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ABSTRACT 


The  duality  model  for  convex  programming  studied  recently  by 


E.  L.  Peterson  is  analyzed  from  the  viewpoint  of  perturbational  duality 


theory.  Relationships  with  the  traditional  Lagrangian  model  for  ordinary 


programming  are  explored  in  detail,  with  particular  emphasis  placed  on  the 


respective  dual  problems,  Kuhn-Tucker  vectors,  and  extremality  conditions 


The  case  of  homogeneous  constraints  is  discussed  by  way  of  illustration 


The  Slater  existence  criterion  for  optimal  Lagrange  multipliers  in  ordinary 


programming  is  sharpened  for  the  case  in  which  some  of  the  functions  are 


polyhedral.  The  analysis  generally  covers  nonclosed  functions  on  general 


spaces  and  includes  refinements  to  exploit  polyhedral ity  in  the  finite 


dimensional  case.  Underlying  the  whole  development  are  basic  technical  facts 


which  are  developed  concerning  the  Fenchel  conjugate  and  preconjugate  of  the 


indicator  function  of  an  epigraph  set 
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SIGNIFICANCE  AND  EXPLANATION 


Linear  proqramming  has  become  an  indispensible  tool  in  solving  managerial 
and  engineering  optimization  problems,  but  in  order  to  apply  linear  programming 
it  is  necessary  to  make  simplifying  assumptions  since  most  real-1 ite  problems 
are  inherently  nonlinear.  Nonlinear  problems  are  an  order  of  magnitude  more 
difficult  than  linear  ones,  and  it  is  only  comparatively  recently  that  it  has 
been  possible  to  develop  a theoretical  understanding  of  nonlinear  programming 
comparable  to  the  well-established  theory  for  the  linear  case. 

Given  any  linear  progranming  problem,  a second  "dual"  linear  programming 
problem  exists  which  plays  an  essential  role  in  both  theoretical  and  practical 
solution  of  this  class  of  problem.  Given  a nonlinear  programming  problem, 
dual  problems  also  exist,  which  can  be  formulated  in  various  ways.  This  is 
the  context  of  the  present  paper. 

This  paper  centers  around  the  theoretical  analysis  of  the  nonlinear 
programming  problem  of  minimizing  subject  to  side  conditions  f Mx)  ^ 0 

(i  = l,...,m),  where  all  the  functions  are  convex,  A broadened  form  of  the 
problem,  introduced  and  studied  recently  by  E.  L.  Peterson  under  the  name  of 
"generalized  geometric  programming",  is  analyzed  here  in  detail  and  extended 
to  general  spaces.  For  this  broadened  problem  an  alternate  but  essentially 
equivalent  formulation,  or  "dual  problem",  exists  which  has  the  same  structural 
form.  When  specialized  to  the  nonlinear  program  above,  a dual  problem  results 
which  differs  slightly  from  the  widely  used  dual  involving  finding  optimal 
Lagrange  multipliers.  Considerable  attention  is  given  to  comparing  and 
contrasting  these  two  dual  approaches  to  solving  the  nonlinear  program.  Along 
the  way,  standard  facts  concerning  nonlinear  convex  programs  are  developed  in 
somewhat  greater  generality  than  previously  available. 


SYMMETRIZED  SEPARABLE  CONVEX  PROGRAMMING 


L.  McLinden 


1.  Introduction.  One  of  the  most  useful  model  problems  for 
convex  optimization  is  the  ordinary  convex  programminq  problem,  in  which  a 
convex  function  is  minimized  subject  to  finitely  many  convex  inequality 
constraints.  The  traditional  dual  approach  to  this  problem  involves  the  so- 
called  Lagrangian  duality  model,  in  which  solving  the  dual  problem  amounts 
to  finding  the  optimal  Lagrange  multipliers.  In  1972  E.  L.  Peterson  [7] 
outlined  an  alternate  duality  theory  for  this  problem,  and  various  aspects 
of  this  model  have  been  developed  in  a series  of  related  papers  [8,  9,  10, 

11,  12,  13,  14,  15).  Peterson's  model  provides  a dual  problem  having  more 
variables  than  the  Lagrangian  dual,  but  in  a sense  requiring  no  suboptimiza- 
tion for  the  dual  objective  function  evaluations.  The  model  is  somewhat  more 
general,  too,  in  that  its  primal  problem  has  additional  structure  built  in 
which  acts  to  force  the  dual  to  have  the  same  form.  That  is,  it  is  a 
"symmetric"  duality  model  for  ordinary  convex  programing.  Additive 
separability  plays  a large  role  in  the  model,  at  least  in  a formal  way. 
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In  this  paper  Peterson's  model  is  analyzed  from  the  viewpoint  of 
R.  T.  Rockafellar's  perturbational  duality  theory  (see,  e.g.  [19]  or  [21]). 

The  setting  is  extended  from  r"  to  general  spaces,  the  lower  semicontinuity 
requirements  on  the  functions  Involved  are  relaxed,  and  the  extremality  or 

! 

optimality  conditions  are  sharpened.  Particular  emphasis  is  placed  on  comparing 
I the  "symnetrlzed"  dual  with  the  usual,  Lagrangian  dual,  and  it  Is  shown  in 

r 

a precise  way  that  the  entire  symmetrized  duality  model  projects  onto  the 
!j  traditional  Lagrangian  duality  model.  Enroute  to  this  comparison,  the  standard 

facts  about  the  Lagrangian  model  are  developed  here  in  a setting  slightly  more 
general  than  previously  available  in  the  literature.  The  case  of  homo-  i 

geneous  constraint  functions  is  discussed  as  an  illustration,  with  the  present  ' 

treatment  extending  recent  work  of  C.  R.  Glassey  [2]  on  explicit  duality 
for  such  problems.  As  a technical  byproduct  of  the  present  framework,  a 
new  existence  criterion  is  established  for  optimal  Lagrange  multipliers  in 
the  case  of  r".  Namely,  it  is  shown  essentially  that  it  suffices  to  have 
a Slater  point  for  which  the  polyhedral  constraint  inequalities  are  satisfied 
only  weakly;  this  sharpens  slightly  Rockafellar's  theorem  [19,  Theorem  28.2], 
j which  already  handled  the> important  case  in  which  the  polyhedral  functions  are 

actually  affine. 

The  plan  of  the  paper  is  as  follows.  In  §2  we  establish 
I notation,  state  the  primal,  dual  and  saddlepoint  problems  associated 

I with  the  duality  model  under  study,  and  indicate  some  of  the  technical  Issues 

['  ; 

treated  later  concerning  the  "symmetrized"  dual  and  the  Lagrangian  dual. 

In  §3  we  derive  certain  technical  facts  on  which  the  entire  development  rests. 

I These  concern  formulas  for  the  Fenchel  conjugate  and  "preconjugate"  of  the 

( Indicator  function  of  an  epigraph  set,  together  with  'he  associated  sub- 

I I 

L _ J 


differential  formulas.  The  preconjugate  result  is  somewhat  novel 


technically,  in  that  it  requires  the  reverse  of  the  contnon  proof  technique  of 
dualizing  a result;  it  is  a "predual"  type  of  result.  In  §4  Peterson's 
model  is  derived  in  broad  outline  form  from  a notationally  streamlined 
cone-constrained  model.  Appeal  is  freely  made  here  to  the  various  facts 
concerning  Rockafellar 's  perturbatlonal  duality  approach.  This  viewpoint 
provides  the  basis  for  distinguishing  quite  clearly  in  §3  between  the 
synmetrlzed  duality  model  and  the  Lagranglan  duality  model.  There,  a detailed 
comparison  of  the  two  models  is  made.  The  focus  is  on  a comparison  of 

the  respective  dual  problems,  Kuhn-Tucker  vectors,  and  extremallty  conditions. 

This  section  can  be  viewed  as  extending  somewhat,  to  the  case  of  general  convex 

functions,  certain  Ideas  Introduced  by  Rockafellar  In  1964  {IBJ  and  later 
developed  by  him  for  the  case  In  which  the  functions  involved  are  faithfully 
convex  [20].  In  §6  certain  projection  phenomena  noted  in  §3  are  examined 
further,  and  it  is  shown  the  precise  sense  in  which  the  entire  synmetrlzed 
model  (consisting  of  primal,  dual  and  associated  saddlepoint  problems)  projects 

onto  the  ordinary  duality  model. 

Throughout,  we  use  rather  freely  the  general  definitions  and 

background  material  concerning  convexity  which  is  found  in  [19,  21). 

2.  Notation  and  Statement  of  model.  The  model  we  shall  study 
is  heavily  dependent  upon  additive  separability,  which  may  either  be  already 
available  naturally  or  else  be  Induced  artificially.  In  either  case  it 
is  therefore  necessary  for  us  to  consider,  at  least  in  a formal  way,  a 
number  of  distinct  spaces.  Thus,  let  there  be  given  convex  functions 
f^,  f^,  fj  defined  on  spaces  X^,  X^,  Xj , where  the  indices  i and  j 
range  over  finite  (possibly  empty)  index  sets  I and  J.  The  spaces  may 
in  general  be  any  real  linear  spaces  equipped  with  locally  convex  Hauadorff 
topologies.  In  particular,  they  might  all  be  r”,  or  Hilbert  spaces,  or  reflexive 


Banach  spaces  und'-r  the  norm  topologies.  Let  the  functions  take  values  in 
(-00,  +oo]  without  being  identically  . and  assume  each  function  has 
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lower  semicontlnuoua  hull  soaewhere  finite.  (The  latter  is  fulfilled  auto- 
matically in  the  case  of  r".) 

Throughout,  when  referring  to  product  spaces  we  use  a convenient 
and  transparent  notational  abbreviation.  Namely,  the  product  space^{X^|i  € 1} 
will  be  denoted  simply  by  X^;  its  elements,  the  ordered  |l|-tuples 
^*i^iei  ^ ®®®^  1^1*  will  be  denoted  simply  by  x*. 

Similarly  for  and  x*^,  and  R"^  and  C*.  It  is  convenient  also 

to  let  X denote  the  product  space 


X - X X X X R^  X R-^. 
o 

When  J * 0,  for  example,  as  is  the  case  in  (2.S)  below,  we  agree  to  inter- 
pret and  R*^  as  the  degenerate  vector  space  consisting  of  just  the 

zero  vector.  Such  trivial  "factor"  spaces  can  clearly  be  carried  along  at 
no  cost,  and  are  effectively  suppressed  when  the  general  omdel  is  specialized, 
such  as  in  §§5,  6 for  instance. 

Now  to  each  f^  associate  a function  on  X^  x R defined  by 


0 if  f^(Xj)  + < 0 

+ <»  otherwise 


(2.n 


and  to  each  f^  associate  a function  Oj  on  Xj  x r defined  by 


f (fjCjXXj)  It  tj  >0 

(f  O^Xx  ) It  C - 0 
+ ~ if  < 0 


(2.2) 


To  within  a minus  sign,  is  just  the  indicator  of  the  epigraph  of  f^, 

and  as  such  carries  a complete  description  of  f^.  In  the  definition  of 
Oj,  the  terms  f^d^  are  the  right  scalar  multiples  of  f ^ , 


defined  by 
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The  term  fjO  Is  the  recession  function  of  f ^ , defined  by 

(fjO'*’)(Xj)  - sup{fj(xj  -f  Xj)  - fj(xj)  I xj  € dom  f ^ } . 

It  serves  to  describe  the  growth  properties  or  asymptotic  nature  of  f ^ . 
Clearly  carries  a complete  description  of  f ^ . Less  obvious  is  the 


fact  that  Oj  is  positively  homogeneous  and  convex  in  Jointly. 
The  Oj's  will  be  seen  to  play  a role  dual  to  that  of  the  Flnall) 
define  on  X an  addltlvely  separable  function  f by  the  formula 


f(x)  - 5“^) 


- f 


1 d 


(2.3) 


The  primal  problem  of  the  duality  model  studied  in  this  paper 
can  now  be  stated: 

(P)  min{f (x) |x  € k}, 

where  f is  given  by  (2.3)  and  K is  a convex  cone  of  the  form 


K - P X Q,  P C X X x*^,  Q - {(^\C‘')U^  - 0). 


-I  .Jvl.I 


for  some  given  nonempty  convex  cone  P.  In  other  words,  (P)  is  the 
problem 


(2. A) 


minimize  ^^(x^)  + ^ I 

X «J 

subject  to  (x^,  x^,  x"^)  G P,  “ 0,  , 

where  the  and  are  as  in  (2.1)  and  (2.2).  Essentially  this  problem 

was  Introduced  by  Peterson  [7]  In  1972,  and  various  technical  aspects  of  it 
are  treated  in  [8-15} . 

Notice  that  the  only  way  In  which  the  variables  in  (P)  can  be 


coupled,  or  made  dependent  on  one  another,  is  by  means  of  the  cone  P. 

This  high  degree  of  formal  additive  separability  makes  possible  some 
technical  simplifications  in  deriving  certain  required  fotmulas  and  also  in 
carrying  out  some  of  the  proofs.  To  Illustrate  how  coupling  of  variables  can  be 
achieved,  consider  what  is  probably  the  most  important  case,  that  in  which 

J-0  and  P-{(x,x^)|x  -x,,\/i€l}.  (2.5) 

o . o 1 

It  is  Immediate  from  (2.3),  (2.1)  and  (2.4)  that  (P)  is  then  Just  the 
ordinary  convex  program 

(P^)  mln{f^(x^) |f^(x^)  £ 0,  V 1 € I), 

provided  we  Ignore  the  presence  of  the  trivial  variables  which  must  be 

zero,  and  also  the  fact  that  there  are  really  1 -f  |l|  copies  of  the 
variable  x^.  Other  special  cases  of  problem  (P)  are  described  in  [8]. 

The  problem  dual  to  (P)  in  the  symmetrized  model  involves  the 
Fenchel  conjugate  of  f, 

f (y)  - sup{<x,  y>  - f(x)}. 

Due  to  additive  separability,  cf.  (2.3),  this  can  be  expressed  in  terms  of 
the  conjugates  of  the  individual  terms  f^,  f^,  f^  making  up  f: 

c*,  S c*i  I J I J. 

f (y)  * f (y^t  y , y . n , n ) 

I J 

Here  the  variables  y,  y^,  y^,  y^,  etc.,  range  over  the  spaces 
Y,  Y^,  Y^,  Y^,  R^,  etc.,  which  are  dual  to  the  original  spaces  (or  more 
generally,  which  are  paired  in  duality  with  them  [1,  3]).  The  value  of  a 
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y 


continuous  linear  functional  y ^ Y at  a vector  x € X is  denoted  by 
<x,  y>,  etc.  In  particular,  in  the  case  of  R*'  this  bracket  notation 
simply  means  the  usual  dot  product  of  two  vectors. 

The  problem  dual  to  (P)  can  now  be  stated: 

(D)  -min{f  (y) [y  € K }, 

* 

where  K is  the  cone  dual  to  K (l.e.  the  negative  polar  of  K) . According 
to  (2.4),  we  have 

K*  - p*  X Q*.  Q*  - - 0),  (2.7) 


and  so  by  (2.6)  this  problem  is  therefore  of  the  form 

-minimize  f^  (y^)  + I ^ ^ °J 

X J 

subject  to  (y^,  y^,  y*^)  € p , € R^,  ■ 0. 


It  is  not  fully  apparent  from  this  that  (D)  really  has  the  same  form  as  (P). 
It  follows,  however,  from  the  facts  that  the  functions  conjugate  to  and 


Cj  are  given  by 


' <'i 

*nj^)(yi)  if 

> 0 

* 

</'i  (y^.Hj:)  * ■ 

V)(y^)  if 

* 0 

(2.8) 

+ 00 

if 

< 0 

and 

•k 

0,  (y.,n.)  « ■ 

J J 1 

( 0 

1 

If  f;(yj)^ 

njiO 

(2.9) 

[ + 00 

otherwise 

respectively. 

This  shows  that 

(D) 

is  formed  from 

convex  functions 

* * 

'o  ’ ^ ’ fj 

* 

together  with  finite  index  sets  I,  J and  convex  cones  P , Q in 


I 


essentially  the  same  manner  as  was  (P) , by  Just  interchanging  the  roles 
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of  I and  J. 

For  the  important  case  (2.3).  in  which  (P)  becomes  essentially 
the  ordinary  convex  program  (P  ) , we  have 


y^)!yo  + I “ o>- 


(2.10) 


This  means  Chat  (D)  assumes  Che  form 


max{-f^*(y^)  - I (f  ^*0^)  (y^)  ly^,  + I y^  “ 0 and  n^iO),  (2.11) 


provided  the  terms  are  interpreted  as  0 whenever  ■ 0. 

We  shall  present  in  §5  a mild  condition  under  which  this  problem  can  be 


simplified  significantly,  namely  by  interpreting  the  terms  f^  <1^  as 


whenever  * 0.  On  Che  other  hand,  the  dual  problem  associated  to  (P^) 
by  the  ordinary,  l.e.  Lagranglan,  model  is 


(D  ) maxiinf  {f  (x  ) + [ H.f. (x  ))}, 

o - oo  ^ilo 

n >0  X ec  ^ 

— o 


where  C * dom  f nO<lom  f . . It  will  be  shown,  among  other  things,  that  Che 
o 1 1 


optimal  values  in  problems  (D)  and  (D^)  always  satisfy 


val  (D)  < val  (D  ) , 
— o 


where  (D)  here  is  understood  to  be  the  problem  (2.11).  Under  relatively 


weak  assumptions  which  vary  slightly  with  the  situation  being  treated,  this 


Inequality  is  actually  an  equality,  and  furthermore  a vector  n solves 


(D^)  if  and  only  if  there  exist  vectors  y^,  y^  such  that  (y^,  y^,  n^) 


solves  (D) , l.e.  (2.11).  In  other  words,  under  mild  conditions  Che  ordinary 


dual  (D^)  is  Che  image  of  the  "symmetrized"  dual  (2.11)  under  the  projection 


transformation  (y^,  y^,  n^)  0^.  Peterson  [14]  has  also  observed  essentially 


■ li 


p 


this  connection  between  the  two  duals.  The  connection  was  also  Indicated 
earlier  by  Rockafellar  rather  implicitly  in  [19,  top  p.  322]  and  explicitly 
in  [20],  where  it  forms  a key  element  in  the  computational  approach  suggested 


there. 


In  196A  Rockafellar  [18,  p.  88,  esp.  Theorem  5]  established  a strong 


duality  theorem  relating  (P^)  to  problem  (2.11).  He  invoked  a Slater-type 

constraint  qualification  and  worked  with  constraint  functions  which  were 

continuous  and  everywhere  finite;  a refinement  to  cover  affine  constraints 

was  included.  Also,  in  [19,  pp.  322-323]  and  [20]  Rockafellar  generated  essentially 

problem  (2.11)  as  a dual  to  (P^)  via  the  general  perturbational  duality 

theory.  This  Involved  equipping  (P^)  with  a certain  rather  full  class  of 

perturbations.  In  the  development  below  we  exploit  the  perturbational 

duality  approach  fully  in  deriving  the  symmetrized  model.  By  this  means  it 

will  be  clear  that  the  present,  syinnetrized  model  steins  from  the  same,  full 

class  of  perturbations,  but  with  the  added  elements  of  separability  and 

symmetry  built  in  as  well. 

Underpinning  the  entire  symmetrized  model  are  the  conjugacy  and 
subdifferential  fomulas  for  the  functions  and  defined  in  (2.1)  and 

(2.2).  The  conjugacy  formulas,  (2.8)  and  (2.9),  have  already  been  invoked 
in  stating  the  dual  problem,  and  the  subdifferential  formulas  will  be  es- 
sential in  developing  the  model's  extremality  conditions.  We  turn  now  to 
the  task  of  deriving  the  specific  facts  which  will  be  needed. 

3.  The  indicator  of  an  epigraph;  its  preconjugate  and  conjugate. 

To  a large  extent,  the  material  in  this  section  builds  on  facts  and  techniques 
concerning  recession  vectors  contained  in  [17,  19].  Proposition  2 
is  somewhat  novel  in  that  it  amounts  to  a "pre-dual"  version  of 
Proposition  1,  as  opposed  to  the  usual  dual  type  of  result.  The  reader  can 
skip  over  the  proofs  in  this  section  on  first  reading. 
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LEMMA  1.  If  f is  a proper  convex  function  with  Isc  f somewhere 

A A -f 

finite,  then  \l),  , “ f 0 • 

dom  f 

PROOF.  For  f actually  closed,  this  was  established  in  [17, 
Corollary  3C(d)],  while  for  f not  necessarily  closed  but  the  underlying 
space  Euclidean,  the  result  was  established  in  [19,  Theorem  13.3].  The 
latter  proof  really  carries  over  to  the  general  situation,  as  we  now 

A 

demonstrate.  Since  f Is  by  definition  the  pointwise  supremum  of  the  family 

of  continuous  affine  functions  of  the  form  h(y)  >•  <x,y>  - where 

A 

(x,C)  ^ epi  f,  it  follows  that  epi  f is  the  (nonempty)  intersection  of 

A >4-  4>  A 

the  corresponding  closed  halfspaces  epi  h.  Hence,  epl(f  0 ) > 0 (epi  f ) 
is  the  recession  cone  of  this  intersection.  By  [17,  Theorem  2A(b)l,  it  follows 
that 

0^  n {epihj  (x,C)  ^ epi  f}  = n {O^  epi  h |(x,^)  G epi  f). 

Now  for  h(y)  = <x,y>  - ^ it  is  clear  that 

0^  epi  h * epi  hO^  * epi  <x,*> 

Combining  these  facts,  we  obtain 

epi(f  0^)  = <''{epi  <x,*>|(x,^)  G epi  f} 

= epi(sup{<x, •> I X G dnm  f}) 

= f>’ 

which  completes  the  proof. 

We  now  deal  with  the  functions  of  type  given  in  (2.1).  Our 

first  result  establishes  formula  (2.8)  and  more. 


PROPOSITION  1.  Let  f be  a proper  convex  function  with  Isc  f 
somewhere  finite,  and  write  E • {(x,5)lf(x)  + ^ ^ O} . Then  cl  ij'g  is  the 


indicator  function  of  {(x,C)|(cl  f)(x)  + ^£0},  and 


(f  n)(y) 

if 

n 

> 0 

'J'g(y,n)  ■ ' 

(fV)(y) 

if 

n 

- 0 

+ 00 

if 

n 

< 0. 

Hence,  one  has  the  inequality 

* 

(f  n)  (y)  £ <x,y>  Cn  whenever  f(x)  C £ 0 £ n, 

* * + 

where  f n is  interpreted  as  f 0 when  n = 0.  Moreover,  equality  occurs  in  this 
inequality  if  and  only  if  (y,n)  ^ Dij;g(x,C).  and  this  is  equivalent  to  the 
conditions 

y G D(nf)(x),  f(x)  + ^ £ 0 £ n,  (f(x)  + C)*n  - o, 

where  nf  is  Interpreted  as  £ when  n ■ 0.  (Note:  3(nf)(x)  - n3f(x) 

when  n > 0.) 

PROOF.  First,  cli/;  * <p  . Since  cl(epl  f)  * epi(cl  f),  it 

E cifc 

follows  that  cl  E has  the  form  asserted.  Next,  we  compute  <^e(y,n).  If 
n < 0,  then  using  any  (x,C)  ^ E we  obtain 

* , — 

</'p(y,n)  £ su£{<x,y>  + 5n}  - + ». 

C£C 

If  n = 0,  we  have 

ipg(y,0)  - sup{<x,y>|  (x,C)  ^ E} 

- sup{<x,y>|x  G dom  f} 

- (fV)(y), 


where  the  last  equality  is  by  Lemma  1.  If  n > 0,  we  have 
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4'g(y.n)  “ 8up{<x,y>  + Cn|f(x)  + f;  1 0} 

••  sup{<x,y>  - nf(x)} 

X 

■ n*sup{<x,n  ^y>  - f(x)} 

X 

* (f  n)(y). 

This  establishes  the  conjugacy  formula.  Using  It,  Fenchel's  inequality 
ii;g(x,C)  + '^*(y,n)  > <x,y>  + 

for  the  function  reduces  to  the  inequality  asserted.  Moreover,  we  know 

that  the  case  of  equality  is  characterized  by  the  condition  (y,n)  ^ DiJ»_(x,C) 

L 

A 

on  the  variables  involved.  By  the  particular  form  of  ip„  and  , this 

£ £ 

is  equivalent  to  having  either 

(a)  f(x)  + 5 £ 0 < n and  (f  n)(y)  “ <x,y>  + 

or  else 

(b)  f(x)  + C £ 0 = n and  (f  0^)(y)  ■ <x,y>. 

Now  we  claim  that  (a)  is  equivalent  to 

(a')  f(x)  + C = 0 < n and  y € 3(nf)(x), 
and  (b)  is  equivalent  to 

(b')  f(x)  + C 1 0 = n and  y e ^(x). 

* * 

Indeed,  suppose  (a)  holds.  Since  n > 0,  the  easy  identity  (nl)  = I H holds. 
From  this  and  the  definition  of  (Tlf)*(y)  follows 

<x,y>  - nf(x)  £ (f  n)(y). 

Combined  with  the  rest  of  the  information  in (a) , this  yields 

f(x)  + C » 0. 


Now  using  this,  we  can  rewrite  the  equation  in  (a)  as 


1 
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nf(x)  + (f*n)(y)  ■ <x,y>, 

which,  in  view  of  the  identity  cited  earlier,  is  equivalent  to  y G D(nf)(x). 
The  converse  implication,  starting  from  (a'),  follows  by  reversing  the  last 
part  of  the  argument  Just  given.  Now  suppose  (b)  holds.  Then  x G dom  f,  so 
that 

(j'doni 


Since  tl),  . » f 0^  by  Lemma  1,  this  means  y G ,(x).  This  argument, 

dom  t dom  t 

too,  reverses  to  establish  the  converse.  To  complete  the  proof,  just  observe 
that  the  disjunction  (a')  or  (b')  is  equivalent  to  the  characterization  we 
needed  to  prove. 

We  turn  now  to  functions  of  type  , given  by  (2.2).  When  f^ 
is  closed,  the  basic  information  needed  later  for  the  symmetrized  model  can 
be  obtained  by  "dualizing"  Rroposition  1 in  the  usual  manner  (i.e.  by  applying 

A A 

it  to  f in  place  of  f and  using  f = cl  f = f ) . In  order  to 

cover  which  are  not  necessarily  closed,  however,  we  now  derive  another, 

more  general  result,  one  somewhat  parallel  to  Propostion  1.  It  covers  the 
application  to  the  arising  from  nonclosed  fj's  as  in  (2.2)  and  also  a 

bit  more.  It  may  be  useful  in  applying  the  symmetrized  model  to  situations 
in  which  the  recession  functions  fjO  are  not  readily  available. 

PROPOSITION  2.  Let  f be  a proper  convex  function  with  Isc  f 
somewhere  finite,  and  let  h be  any  function  (not  necessarily  convex) 
satisfying  (cl  f)0^  £ h £ choices  h » and  h *=  fO^  are 

the  main  ones.)  Then  the  (not  necessarily  convex)  function  a defined  by 


0(x,0 


(fO(x)  if  C > 0 

h(x)  if  C - 0 

+ «>  if  C < 0 
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satisfies 


a « \Ij  , where  E = {(y,n)|f  (y)  + H £ 0} 


and  in  fact 


(Isc  o)(x,C)  = tj<g(x,C)  * i 


((cl  f)5)(x)  If  C > 0 

((cl  f)0'‘’)(x)  if  ? - 0 

+ » if  C < 0 , 


where  Isc  o is  closed  proper  convex.  Hence,  one  has  the  Inequality 

* 

(fC)  (x)  ^ <x,y>  + Cn  whenever  f (y)  + H £ 0 £ ^ , 

where  fC  is  interpreted  as  h when  C - 0.  Equality  occurs  in 
this  inequality  if  and  only  if  (y,n)  ^ i>o(x,^),  and  this  is  equivalent  to 
the  conditions 

y e 3(f^)(x),  f (y)  + n 1 0 _<  5 . (f  (y)  + n)*C  • 0, 

where  again  f^  is  Interpreted  as  h when  ^ = 0.  Moreover,  the  function  I 

o is  convex  if  and  only  if  h is  convex  and  satisfies  ^ ^ ^ fO^.  i 

■f  j 

(Thus,  the  choice  h = fO  is  the  least  which  will  make  a convex.)  Finally,  | 

in  the  event  that  X = r",  the  function  a is  polyhedral  convex  if  and  | 

+ I 

only  if  f is  polyhedral  convex  and  h » fO  . j 

PROOF.  First  we  show  why  h can  be  taken  to  be  fO^,  i.e.  why  ] 

(cl  f)0^  £ fO^.  This  is  equivalent  to  epi  fO^  C epi  (cl  f)0^.  Since  ' 

i 

epi  fO^  = 0^  epi  f,  epi (cl  f)0^  “ 0^  epi  cl  f = 0^  cl  epi  f,  . ! 

the  question  reduces  to  whether  0'*'s  C o'*'(cl  S)  holds  for  a nonempty  convex 


set  S.  But  this  follows  from  [17,  Theorem  2A(b) ] (or  [19,  Theorem  8.3]  in 
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the  Euclidean  case).  Next,  we  work  towards  the  formulas  for  Isc  o and 

A 

o . To  this  end,  let  denote  the  "sigma*'  function  corresponding  to 

the  choice  h = let  o correspond  to  an  arbitrary  h of  the  sort 

specified  in  the  hypotheses,  and  finally  let  O2  denote  the  asserted  form 
of  Isc  a.  We  have  02^0^0^^  by  these  definitions  (since  cl  f £ f), 
so  that 


* * * 

Isc  £ Isc  a £ Isc  £0  £ O2  ‘ 


From  Proposition  1 we  have 


- ® < Isc  02  = 02  = , O2  = 


for  E = {(y,n)|f  (y)  + n £ O}.  We  also  have  that 


<^2  “ sup{<x,y>  + - o^(x,C)} 


where 


so  that 


max  {0,a}, 


a = sup  {<x,y>  + - (fC)(x)} 

C>0,x 

= sup  sup  {<x,y>  + Cn  - ?f(C  ^x)} 
C>0  X 

= sup  {?• (supf<x,y>  - f(x))  + n)} 

C>0  X 

= sup  (C* (f  (y)  + n) 

C>o 

= 'i'gCy.n), 


♦e  ■ ”2  • 


Since  Isc  O^^  > - <»,  it  follows  that 


•kit 


Isc  0.  « cl  a,  - 0,  ** 

1 1 1 E 2 
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In  view  of  the  above  Inequalities  bounding  Isc  o and  o , it  follows  chat 


Isc  o " ij/g  , 


o - 


Next,  using  the  formula  a « tp  , we  can  write  Fenchel's  inequality  for 

E 

the  function  o as 


o(x,5)  + >J;_(y,n)  > <x,y>  + Cn. 


This  is  the  general  inequality  asserted.  Equality  occurs  in  it  if  and  only  if 
(y»n)  ^ 3a(x, 5)-  Taking  into  account  the  special  form  of  o and  ip  , this 

Ct 

is  equivalent  to  having  either 


or  else 


(a)  f (y)  + n 1 0 < £;  and  (fC)(x)  - <x,y>  + Cn 


(b)  f (y)  + n £ 0 » 5 and  h(x)  * <x,y  > 


By  an  argument  similar  to  that  used  in  Proposition  1,  one  can  verify  chat  (a) 
is  equivalent  to 

(a')  f (y)  + n = 0 < 5 and  y € 3(f^)(x). 

4t  A .f- 

(One  uses  the  identity  (fC)  • ^f  . ) Now  consider  (b).  From  (cl  f)0  £ h £ 

it  follows  that 

" <0}  1 ^ = ^cKdom  f*)  * 

★ * 

where  the  last  equality  is  by  Lemma  1 applied  to  f . Hence,  h is 

A 

identically  zero  on  cKdorn  f ).  From  this  fact  and  the  observation  that 

A A 

f (y)  + Ti  £ 0 forces  y € dom  f , it  follows  that  (b)  is  the  same  as 
(b')  f(y)+r|£0“C  and  y e Dh(x). 


and  y S Dh(x) . 
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Hence,  (y,n)  ^ Dcj(x,?)  Is  equivalent  to  having  either  (a')  or  (b*) 
hold,  which  Is  essentially  Che  assertion. 

Now  we  Cackle  Che  convexity  cnaracCerlzaClon.  Since  0 Is 
convex  on  X x R if  and  only  if  its  restriction  to  each  line  in  X x R 
is  convex,  let  us  examine  the  behavior  of  the  one-dimensional  function 

Oj^(X)  - a(KyK^),  - ® < X < + » 
on  lines  L consisting  of  points  of  the  form 

(x^,Cj^)  - (1-X)(Xq,5q)  + X(xj^,?j^) 

for  distinct  pairs  in  X x r.  Each  such  L can 

be  regarded  as  being  of  one  of  three  types:  (i)  * ^1  " ^0  ” ^1 

and  (ill)  Cq  * 0 and  “ 1*  The  convexity  of  for  all  lines 

L of  type  (i)  is  clearly  equivalent  to  convexity  of  h on  X.  Next,  consider 
the  function 


Ol(x,C) 


(f?)(x) 

if 

K > 0 

'1^(0}  (X) 

if 

5 = 0 

+ oo 

if 

5 < 0 

which  is  convex  because  f is  (see  [19,  page  35]).  Since  O (X)  for  lines 
L of  type  (ii)  coincides  with  the  restriction  of  a^(x,C)  to  L,  it 
follows  tbit  convexity  of  o (X)  is  automatic  on  all  lines  of  type  (ii). 

Li 

Now  consider  any  fixed  line  L of  type  (iii);  that  is,  let  pairs  (x^.O) 
and  (x  ,1)  be  given  (notice  that  = X in  this  case).  Now  0 (X) 

X A Li 

coincides  with  the  restriction  of  aj^(x,C)  to  L everywhere,  except 
perhaps  at  the  point  (Xq,0)  corresponding  to  X = 0.  Since  for  such  an  L 
the  function  o (X)  has  the  value  + “>  for  negative  X and  is  known  to 

la 
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be  convex  for  positive  A,  it  will  be  convex  on  all  of  R if  and  only  if 

either  a (0)  ^ lim  a (A)  or  o (A)  ■ + “>,V  A > 0. 

^ A4-0  ^ ^ 

Now  the  first  of  these  alternatives  is  the  same  as 

h(x  ) > lim  (fA)(x,), 

° " A+0  ^ 

while  the  second  can  easily  be  seen  equivalent  to 

X,  ? dom  f + (-«>,l)x  . 

1 o 

It  follows  that  convex  on  all  lines  L of  type  (ill?  having  the 

same  x^  parameter  if  and  only  if 


h(x  ) > sup  {lim  (fA)(x,  )lx-  € dom  f + (-«>,l)x  }. 
° " A+0  ^ ^ ° 

Let  denote  this  supremum.  We  can  rewrite  it  as 


1!  s(x  ) = sup  sup  {lim  (fA)(x,)|x-  “ x + x_  - Bx^} 

1 ° x€dom  f B>0  A+0 

I = sup  {t(x^,x)|xG  dom  f}, 

i, 

% where  for  each  x € dom  f 


t(x  ,x)  = sup  {lim  Af(x  + (A  ^-3)x  )} 

° B>0  A+0  ° 

= sup  {lim  A ^ f(x  + (A-B)x  )} 
B>0  A+<»  ° 

= sup  {lim  (3+t)  ^f(xfTx  )}. 
8>0  T+<»  ° 

Now  it  is  not  hard  to  show  that 


lim  (B+t)  ^f(»fTx  ) - lim  T ^f(xfTx  ) 

xfoo  ° xf® 

for  each  B > 0,  x e dom  f,  x^  € x.  (To  see  this,  let  0(t)  - f(x+Tx^)  and 
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lim  T ^0(t).  If  y is  finite, 

Xfoo 

lim  (6+T)"^e(T)  - lim  [(B+t)"\]*[t"^0(t)] 


Xfoo 


Tfoo 

lim  [ (6+t)”^t] ‘lim  [t”^0(t)] 


Tfoo 

1 • Y 


Tfoo 


If  Y is  then  (B+t)"^0(t)  < t”^0(t)  implies 


lim  (6+t)  ^0(t)  £ y = - 

Tfoo 

If  Y is  +“»  let  M > 0 be  given.  We  can  choose  T £ 6 so  that 
T ^0(t)  £ 2M  whenever  t £ T . Since  (6+t)  £ 1/2  for  x £ T , 

it  follows  that 


(3+T)‘^e(T)  - [(6+t)“^t]*It‘^0(t)]  > (1/2)  •(2M)  = M 


whenever  T £ T.  This  shows  lim  (6+t)  0(t)  = +oo  also,  completing  the 

Tfoo 

verification.)  Using  this  fact,  we  have  that 

t(x  ,x)  = lim  T ^f(x+Tx  ), 

° Tfoo  O 

and  hence 

s(x  ) = sup  {lim  T ^f (x+Tx  ) [x  G dom  f} 

® Tf®°  ^ 

= sup  (lim  T ^[f(x+Tx  )-f(x)](x  S dom  f} 

Tfoo  ° 

= sup{  sup  (t  ^[f(x+Tx  )-f(x)]}j  x€  dom  f} 

0<T<“  ° 

= sup  {f(xfx^)  - f(x)|x  € dom  f} 

» (fO'*’)(x^). 

It  follows  that  o (X)  is  convex  on  all  lines  L of  type  (iii)  if  and 

u 
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only  if 

h(x  ) > f0^(x  ),  V X € X. 
o - o o 

Combining  this  fact  with  the  earlier  ones  concerning  lines  of  type  (1) 
and  (ii)  completes  the  proof  of  the  convexity  characterization. 

Finally,  assume  X ■*  r"  and  suppose  that  0 is  polyhedral  convex. 
Since  o is  clearly  proper,  a is  therefore  closed  [19,  Corollary  19.1.2] 
and  agrees  with  Isc  o.  From  the  formula  already  established  for  Isc  o, 
it  follows  that 

fC  * (cl  f)C  for  each  ^ > 0 and  h * (cl  f)0^. 

The  choice  C * 1 In  the  first  fact  yields  f closed,  and  so  the  second 
fact  yields  h » fO^.  The  fact  that  f is  polyhedral  is  immediate  from 
the  fact  that  epl  f is  essentially  just  the  intersection  of  epl  a with 
the  hyperplane 


{(x,C,u)  e r"  X R X Rj^.  1}, 

and  hence  is  a (nonempty)  polyhedral  convex  set.  Suppose  conversely,  now, 

that  f is  polyhedral  proper  convex  and  h = fO^.  Then  f is  closed 

[19,  Corollary  19.1.2],  so  the  formula  for  Isc  o shows  that  Isc  o, 

** 

hence  o is  closed  proper  convex.  Therefore  o = o , where  we  know  that 


* 

0 (y,n) 


0 if  f (y)  + n £ 0 
+ “ otherwise  . 


This  shows  O is,  to  within  a minus  sign,  the  indicator  of  epi  f . Since 

* * * 
f is  polyhedral  [19,  Theorem  19.2],  this  means  O is,  and  hence  o » (a  ) 


is  also  [19,  Theorem  19.2]. 
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We  remark  that  Proposition  1 for  the  closed  case  can  be  deduced 
from  Proposition  2 by  choosing  h » fO^  and  then  dualizing  the  resulting 
facts . 

A.  The  symmetrized  separable  duality  model.  Here  we  derive  the 
overall  structure  and  prove  the  basic  facts  concerning  the  general  synmetrized 
model.  That  is,  starting  from  the  primal  problem  (P)  given  in  §2,  we 
generate  (D)  and  an  associated  saddlepolnt  problem.  Then  we  develop  the 
associated  Kuhn-Tucker  theorem,  and  provide  several  criteria  for  strong 
duality.  Including  an  especially  sharp  criterion  for  the  finite-dimensional 
case. 

Our  approach,  as  implied  in  §2,  is  to  think  of  (P)  initially 
as  simply  a problem  of  minimizing  a convex  function  f over  a convex 
cone  K,  suppressing  for  the  moment  the  particular  structure  of  f and  K. 
Symmetric  duality  for  this  very  general  form  of  the  problem  dates  back  to 
1963  (Rockafellar  [16],  see  also  [19,  Theorem  31. A]).  Although  one  can,  of 
course,  regard  this  as  a special  case  of  the  original  Fenchel  duality 
theorem,  for  what  follows  we  prefer  to  present  it  Instead  within  the  broader 
perspective  of  perturbational  duality.  In  doing  this,  we  follow  the  general 
formulation  given  in  [21].  This  has  the  advantage  that  many  of  the  general 
results  from  [21]  can  immediately,  or  at  least  fairly  readily,  be  translated 
into  the  present  situation.  For  this  reason,  in  the  development  below  we 
limit  ourselves  to  writing  down  only  the  most  central  results  for  the  present 
model,  leaving  to  the  Interested  reader  the  task  of  stating  the  many  other, 
more  special  results  which  can  be  so  obtained. 


Let  Y denote  the  space  paired  with  X,  and  let  V be  another 


copy  of  Y 

and 

U 

be  another  copy  of 

X. 

From  now  on,  we  regard  X as 

paired  with 

V 

and 

U as  paired  with 

Y. 

Now  consider  the  convex  function 

1 


1 

1 


) 

i 


i 


F on  X X U given  by 


F(x,u)  - f(x)  + ij/j^(x+u). 


(4.1) 


and  the  associated  minimization  problem 


min  {F(x,  0)}  , l.e.  min{f(x)}, 

X 3^K 


(4.2) 


From  F form  a concave  function  G on  Y x V by  means  of  the  formula 


G(y,v)  = inf  {F(x,u)  + <u,y>  - <x,v>} 


[21,  eq.  (4.17)].  In  view  of  (4.1),  a straightforward  computation  yields  that 


where 


G(y,v)  = -<|'j^*(y)  - f (y+v). 


K “ {v|  <x,v>  ^0,  \/  X € K } 


(4.3) 


The  problem  dual  to  (4.2)  is  now  formed  using  G: 


max  {G(y,0)},  i.e.  -min  {f  (y)}. 
y y€K* 


(4.4) 


The  "generalized  Lagranglan"  saddlepoint  problem  associated  with  (4.2)  and 
(4.4)  is  that  which  corresponds  to  the  saddle  function  H on  X ^ Y given 
by  the  formula 


H(x,y)  = inf  {F(x,u)  + <u,y>} 


[21,  eq.  (4.2)].  Another  easy  calculation  using  (4.1)  yields  that 


H(x,y) 


f f(x)  - i['„*(y)  - <x,y>  if  x e dom  f 


(4.5) 


otherwise 
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The  associated  Lagrangian  problem  is  thus 

(L)  mlnimax  {f(x)  - <x,y>}.  (4.6) 

o^dom  f y €K* 

Notice  that  this  problem  is  essentially  linear  in  y (disregarding  the  relatively 

■k 

simple  cone  constraint  K ).  The  abstract  Kuhn-Tucker  condition  (cf.  (19,21]), 
i.e.  extremality  or  optimality  tfi'irion,  corresponding  to  problems  (4.2), 

(4.4)  and  (4.6)  is 


(0,0)  e 3H(x,y). 

This  is  equivalent  to  the  property  that  the  pair  (x,y)  is  a saddlepoint  of 
H,  i.e.  a solution  of  (L) , and  this  is  easily  seen  to  be  equivalent,  in 
view  of  (4.5),  to  the  pair  of  subdifferential  relations 


y e 3f(x),  -X  e 3i^j^*(y). 


(4.7) 


The  symmetrized  model  now  essentially  follows  from  the  above, 
upon  choosing  f and  K as  in  (2.3)  and  (2.4)  and  invoking  the  information 
contained  in  Propositions  1 and  2.  In  view  of  the  very  general  formulation 
of  Proposition  2,  though,  we  can  actually  treat  with  no  additional  effort 
a form  of  (P)  slightly  more  general.  Specifically,  we  assume  henceforth 
that  f is  defined  as  in  (2.3),  but  with  the  slight  change  that,  instead 
of  taking  the  form  (2.2),  we  permit  them  to  be  of  the 

form 


aj(Xj.5j) 


(fj5j)(xj)  If  5j>0 
• hj(xj)  If  5j.o 

+ 00  if  < 0, 


(4.8) 


where  h is  any  given  extended-real-valued  convex  function  satisfying 
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"'(0)^  '■j  - fj"'"  • 

With  this  choice  of  f and  K,  problem  (4.2)  becomes  the  primal  problem 
(P)  introduced  following  (2.3),  where  it  is  understood  that  the 
may  be  of  the  more  general  form  (4.8),  (4.9). 

Turning  now  to  the  dual,  in  problem  (4.4)  we  have  K given 
by  (2.7).  By  Propositions  1 and  2,  formulas  (2.8)  and  (2.9)  are  valid  for 

it  :k  it 


and  Oj  , respectively.  When  substituted  into  formula  (2.6)  for  f , 

it 

these  serve  to  express  f directly  in  terms  of  the  individual  conjugates 

it  it  it  it  it 


f^  , f^  , fj  . The  resulting  expression  for  f reveals  f as  having 

structure  essentially  Identical  (i.e.  "symmetric")  with  that  of  f,  except 

* 


(4.9) 


for  the  roles  of  i and  j having  been  Interchanged.  Note  that  in  Q 
also  the  roles  of  1 and  j are  Interchanged.  This  shows  that  problem 
(4.4)  does  reduce  to  problem  (0)  of  §2,  and  Chat  this  problem  is  indeed  a 
symmetric  dual  of  (P). 

The  reader  can  easily  imagine  the  saddlepoint  problem  (4.6) 
written  also  in  terms  of  the  additional  structure  now  present. 

Consider  now  the  perturbational  aspect  of  the  model.  Formula  (4.1) 
shows  that  problem  (4.2)  is  embedded  in  a whole  class  of  parametrized  problems 


u >•  min  { F(x,u)}  , 

X 

in  which  the  perturbation  parameter  u measures  the  amount  by  which  the 
cone  K and  the  graph  of  f have  been  shifted  horizontally  in  relation  to 
each  other.  An  examination  of  (4.3)  and  (4.4)  reveals  the  (symmetric) 
fact  that  the  same  type  of  perturbations  are  involved  with  the  dual  problem. 
It  is  an  important  fact  about  perturbational  duality  that,  when  there  is  no 
duality  gap,  Che  dual  optimal  solutions  serve  to  describe,  in  terms 
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of  directional  derivatives  and  subf'.rnd  i <Mit  theory,  the  sensitivity  or 
instantaneous  rate  of  change  of  the  primal  optimal  value  with  respect  to 
small  perturbations  in  a given  direction  u,  say.  (See  (19,  Theorem  29.1] 
and  [21,  Theorems  16  and  17].)  Viewed  in  relation  to  the  specific  structure 
involved  in  (P),  the  above  class  of  perturbations  for  (4.2)  are  as  follows. 
Regarding  the  cone  P as  stationary  (in  relative  terms),  the  epigraph  of 
f^  is  translated  horizontally,  the  epigraphs  of  the  are  each 

translated  both  horizontally  and  vertically,  and  the  epigraphs  of  the 
are  each  both  translated  horizontally  and  also  subjected  to  a positive  scalar 
magnification  with  respect  to  the  origin.  A dual  optimal  solution  will, 
according  to  the  general  theory,  usually  describe  primal  problem  sensitivity 
with  respect  to  any  vector  combination  of  the  above  types  of  problem 
perturbations. 

A dual  optimal  solution  y which  satisfies  the  condition 

val(P)  = val(D)  = -f  (y)  G p,  y G k , (4.10) 

is  called  a Kuhn-Tucker  vector  for  (P) . The  importance  of  such  vectors  stems 
from  the  following  type  of  result. 

PROPOSITION  3.  Let  y be  a Kuhn-Tucker  vector  for  (P) . Then 
the  solutions  of  (P),  if  any,  occur  among  the  global  minimizers  of  the 
function 

X H(x,y)  = f(x)  - <x,y>  . 

In  particular,  x solves  (P)  if  and  only  if 


and 


f(x)  - <x,y>  ■ inf{f  - <•  , y>} 
X 


X G K,  <x,y>  ■ 0 . 


PROOF.  From  the  way  G and  H were  introduced  above,  we  have  that 


G(y,v)  = inf  {inf  {F(x,u)  + <u,y>}  - <x,v>} 


inf  {H(x,y)  - <x,v>} 


For  any  y G K , it  therefore  follows  from  (4.3)  and  (4.5)  that 


-f  (y)  » inf  {H(x,y)}  = inf  { f - <*,  y>}  . 

X X 


(4.11) 


Now  assume  y Is  a Kuhn-Tucker  vector  for  (P),  and  write  y for  the  common 
optimal  value  in  (4.10).  Suppose  x solves  (P) . Then 


f(x)  = y , X e K 


But  by  (4.10)  and  (4.11)  we  also  have 


* * 
y = -f  (y)  < f(x)  - <x,y>  , y e K 


Using  y G R,  we  can  combine  these  facts  to  deduce  both  0 £ - <x,y>  and 
0 <x,y>  . Hence  0 = <x,y>,  and  so  it  follows  (using  (4.11)  again)  that 


f(x)  - <x,y>  = y = inf{f  - <•  ,y  >}  . 

X 

On  the  other  hand,  suppose  that  x satisfies 


X G K,  <x,y>  = 0,  f(x)  - <x,y>  = inf  {f  - <*,y>} 

X 

Using  (4.10)  and  (4.11),  we  can  then  write 


y = -f  (y)  = f(x)  - <x,y>  , 


and  hence  y ■ f(x).  Since  also  x G k,  this  shows  that  x solves  (P). 

The  next  result  includes  an  explicit  characterization  of  the 


Kuhn-Tucker  conditions. 


27 


PROPOSITION  4.  In  order  that  x solve  (P)  and  y be  a 
Kuhn-Tucker  vector  for  (P),  it  is  necessary  that  (x,y)  solve  (L) , and 
this  occurs  if  and  only  if  (x,y)  satisfies  the  Kuhn-Tucker  conditions 


(x^.x^.x*^)  e cl  P,  (y^.y^.y*^)  e p , <(x^,x^,x'^) , (y^,y^,y'^)>  = 0, 


^o  ^ ^^o^V’ 


fi(Xi)  1 0 1 


(yj)loiCj 


= 0 


^i  ^ 3(n^f^)(x^) 


V i e I, 


- 0 


y.  e 3<fj5j)<xj) 


J E J, 


where  is  to  be  interpreted  as  f whenever  = 0,  and 

fjCj  is  to  be  interpreted  as  h^  whenever  = 0.  Conversely,  if  (x,y) 
satisfies  the  Kuhn-Tucker  conditions  just  listed  and  if  the  cone  P is  closed, 
then  x solves  (P)  and  y is  a Kuhn-Tucker  vector  for  (P) . 

PROOF.  The  condition  -x  € D4;j^*(y)  in  (4.7)  is  equivalent  (using 
the  bipolar  theorem)  to 


X e cl  K,  y e K , <x,y>  = 0. 


The  special  form  of  K,  cf.  (2.4),  permits  this  to  be  written  as  the 
two  conditions 

(x  ,x^,x‘^)  e cl  P,  (y„,y^,y‘^)  e p*,  <(x  ,x^,x'^),  (y  ,y^,y'^)>  = 0 

o o o o 


= 0,  e R-^,  6 R^,  = 0 . 


Now  consider  the  other  condition  of  (4.7),  y G Df(x).  It  is  an  elementary 
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consequence  of  the  additive  separability  of  f that  this  splits  into  the 
conditions 


(y^.n.)  e 3.;;. (x. ,c.),  V'l  e i, 

(y..n.)  6 30  (x  , c. ),  V j ^ J. 

J J 1 j j 

By  the  subdifferential  formulas  established  in  Propositions  1 and  2,  the  last 
two  sets  of  conditions  can  be  broken  down  further.  Combining  all  the  resulting 
facts  with  the  cone  information  gives  the  characterization  of  the  Kuhn-Tucker 
conditions  asserted.  Next,  the  necessity  assertion  is  immediate  from  the 
implication  (e)  ^ (f)  of  (21,  Theorem  15].  When  P is  closed,  so  is  K, 
and  hence  the  underlying  function  F defined  in  (4.1)  is  closed  convex  in 
u for  each  fixed  x.  Nearly  everything  in  the  converse  assertion  now  follows 
immediately  from  the  implication  (f)  =*  (e)  of  (21,  Theorem  15].  The  only 
remaining  item  to  prove  is  finiteness  of  the  saddle  value.  But  it  is  easy  to 
check,  using  (4.5),  that  if  H has  a saddlepoint  then  its  saddlevalue  must 
be  finite. 

The  following  corollary  corresponds  to  the  classical  Kuhn-Tucker 
theorem.  The  traditional  role  of  some  sort  of  constraint  qualification  is 
played  here  by  the  relation 

inf(P)  = max(D),  (4.12) 

which  is  sometimes  called  "strong  duality."  We  understand  (4.12)  to  mean  that 

val(P)  = val(D)  » -f  (y)  for  some  y € K . (4.12') 

Note  that,  while  the  y's  fulfilling  (4.12')  necessarily  solutions  to 


(D),  they  need  not  in  general  be  Kuhn -Tucker  vectors  for  (P),  cf.  (4.10). 

This  is  because  the  common  optimal  value  in  (4.12')  could  perhaps  be  - <» . 

COROLLARY  4A.  ("Kuhn-Tucker  Theorem")  Assume  that  the  strong 
duality  relation  (4.12)  holds.  In  order  that  x solve  (P),  it  is  necessary 
that  there  exist  a y such  that  (x,y)  satisfies  the  Kuhn-Tucker  conditions 
listed  in  Proposition  4.  When  the  cone  P is  closed,  this  necessary  condition 
is  also  sufficient. 

PROOF.  Let  X solve  (P)  and  y satisfy  the  condition  in  (4.12'). 

it 

Since  f is  never  -»  , val(P)>  - “>;since  f is  never  - <»,  val(D)<+ 

Hence  the  common  optimal  value  in  (4.12')  is  finite,  so  that  y is  in 

fact  a Kuhn-Tucker  vector  for  (P) . Now  apply  the  proposition. 

For  the  necessary  condition  in  Corollary  4A  to  have  substance,  we 

must  provide  conditions  which  guarantee  that  relation  (4.12)  holds.  The 

section  concludes  with  two  such  conditions.  The  first  is  an  all-purpose 

condition,  applicable  to  the  most  general  spaces. 

PROPOSITION  5.  The  strong  duality  relation  (4.12)  holds  if 

there  exist  (x  ,x^,x'^)  £ P and  > 0 such  that 
o 

(i)  f is  bounded  above  on  some  neighborhood  of  x ; 
o o 

(ii)  for  each  i £ I there  exists  > 0 such  that  i 
on  some  neighborhood  of  x^;  and 

(iii)  for  each  j £ J,  o is  bounded  above  on  some  neighborhood 
of  (Xj  , 4;^  ) . 


We  remark  that  when  an  f^  is  continuous,  the  corresponding 

condition  in  (ii)  can  of  course  be  replaced  by  the  condition 

Also,  when  the  topology  on  is  determined  by  a norm,  it  can  be  shown 

that  the  corresponding  condition  in  (iii)  can  be  relaxed  to  simply  requiring 

that  f be  bounded  above  on  some  neighborhood  of  ^ ^ x . 

J J J 


A 


f 

I I 
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PROOF.  By  [21,  Theorem  17],  it  suffices  to  ensure  that  the 
primal  optimal  value  function  'P(u)  = inf{F(x,u)}  is  bounded  above  on 

X 

a neighborhood  of  the  origin.  Since  by  (4.1)  V’  satisfies 
ip(u)  = inf  {f(x)  + ijjj^(x+u)} 

X 

= inf  {f(x-u)  + 

X 

for  any  x,  it  suffices  to  have  an  x G K such  that  the  function  u -*■  f(x-u), 
is  bounded  above  on  some  neighborhood  of  th<.'  origin.  The  rest  of  the  proof 
consist  of  translating  this  condition  into  the  form  asserted,  using  the 
particular  form  of  f and  K given  in  (2.3),  (2.1),  (4.8)  and  (2.4). 

The  second  result  along  these  lines  deals  rather  fully  with  the 
basic  case  in  which  all  the  spaces  are  finite-dimensional.  In  §5,  as 
an  application  of  this  result,  we  shall  obtain  a new,  refined  criterion  for 
the  existence  of  optimal  Lagrange  multipliers  for  ordinary  convex  programming 
in  r". 

The  relative  interior  of  a set  will  be  denoted  by  "ri". 

PROPOSITION  6.  Assume  all  the  spaces  are  finite-dimensional 

and  that  val(P)>  - “.  Then  a Kuhn-Tucker  vector  for  (P)  exists  (and 

hence  the  strong  duality  relation  (4.12)  holds  a fortiori)  if  there 

exists  an  > 0 and  (x  .x^.x"^)  G P such  that  the  conditions 
— o 


(x  ,x^,x'^)  G ri  P,  (4.1>3) 

o 

X G ri  dom  f , (4.14) 

o o 

Xj^  G ri  dom  f^  and  fj,(x^)  < 0,  (4.15i) 

> 0 and  Xj  G C^ri  dom  f ^ , (4.16j) 


i 


i 

I 

! 
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hold  for  all  i € I and  j^j.  Moreover,  when  any  or  all  of  the  problem 
elements  P,  f^,  f^,  f^  are  polyhedral,  corresponding  weakenings  in  conditions 
(4.13)  through  (4.16)  can  be  made  as  follows: 


(x^,x  ,x  ) e p, 


X € dom  f , 
o o 


fi(Xi)  1 0, 


h.  =f.0,C.  >0,  x.S  dom  f.C.  > 

J J 1 - 1 J j 

where  f.?.  is  f.O^  if  = 0.  J 

PROOF.  Since  the  hypotheses  entail  finiteness  of  y = val(P) 
and  since  the  optimal  value  in  (D)  cannot  exceed  y , it  suffices  to 
prove  that 

* * 
y = -f  (y)  for  some  y = K . 


(4.13') 


(4.14') 


(4.15'.) 


(4.16' J 


(4.17) 


We  shall  first  provide  an  argument  leading  to  (4.17),  introducing  hypotheses 

as  needed,  and  afterwards  indicate  why  the  condition  in  the  proposition 

guarantees  the  hypotheses  needed  by  our  argument.  We  begin  by  scanning  all 

the  functions  f , ip . , o.  involved  in  f (given  by  (2.3),  (2.1),  (4.8),  (4.9)) 
o 1 3 

to  see  which  are  polyhedral  and  which  are  not.  Note  that  is  polyhedral 

if  and  only  if  f^  is,  and  (by  Proposition  2)  is  polyhedral  if  and 

only  if  fj  is  and  also  h^  = ® suitable  relabeling  (permutation) 

of  the  coordinates  of  X,  we  can  partition  it  as  X = x X2  in  such 
a way  that  f can  be  written  in  the  form 


f(x)  * f(x^,X2)  - p(x^)  + q(x2). 


(4.18) 
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where  p consists  of  all  the  polyhedral  terms  of  f and  q consists 
of  all  the  nonpolyhedral  terms.  Next,  effectively  extend  the  domains  of 
definition  of  p and  q to  all  of  X by  introducing  functions  p and 
q via 


and 


p(x)  = Hxj^,X2)  = p(Xj^)  ( Vx2) 
q(x)  = q(x^,X2)  = q(x2)  (\/  x^^) 


(4.19) 


(4.20) 


For  definiteness  in  what  follows,  we  suppose  that  P is  polyhedral.  Then 
K is,  too,  and  so  ’J^j^(x)  ® 4)j^(Xj^,X2)  is  also  (even  under  the  relabeling  of 
variables,  since  that  operation  is  accomplished  via  a linear  transformation). 
We  can  now  write 


= inf  {f(x)  + li^j^(x)} 
X 


inf  {p(x^)  + q(x2)  + il'jj(x^  ,X2) } 


(x^,X2) 


= inf  {p(Xj^,X2)  + q(Xj^,X2)  + 4)j^(Xj^ ,X2) } 


(x^,X2) 


-[(p+il^j^)  + q]  (0,0). 


(4.21) 


Since  P "*■  polyhedral,  by  [19,  Theorem  20.1]  we  have 


[(P  + + q]*(0,0)  = (p  + □ q (0,0), 


where  the  infimal  convolution  on  the  right  is  actually  attained,  provided  the 
hypothesis 


(J)  ^ dom(p  + \|>j^)  ri  dom  q 


(4.22) 


is  satisfied.  This  means,  according  to  (4.21),  that 


U - -((p+<[^,^)  (-Wj^,-W2)  + q (Wj^,W2)l 


(4.23) 


J 
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foi  some  w = belonging  to  the  space  Y ® paired  with 


X = X X2*  Since  (4.20)  yields 


q (w)  = q (w^.w^)  = 


q (w^)  if  » 0 


+ “ otherwise 

and  since  p >-«>  , it  follows  from  (4.23)  that 


Wj^  = 0 and  q (w^^.w^)  * q (W2) . 


(4.24) 


By  [19,  p.  179],  we  also  have 


(P  + (0,-W2)  = p a \p^*(0,-W2): 


where  the  inf~convolution  on  the  right  is  attained,  provided  the  hypothesis 


^ ^ dom  p dom  ip 


(4.25) 


is  satisfied.  This  means,  according  to  (4.23)  and  (4.24),  that 


y * -[p  (Zj^,z^)  + (0-Zj^,-W2-Z2)  + q (W2)  ] 


(4.26) 


for  some  z » (Zj^,Z2)  in  Y.  Since  (4.19)  yields 


I P (Zi)  if  ^2  ° ° 
p (z)  = p (Zj^,Z2)  = ^ 

+ «»  otherwise 


and  since  p > , it  follows  from  (4.26)  that 


* 

Z2  = 0 and  p (z^,Z2)  = P (Zj^). 


From  this,  (4.26)  can  be  rewritten  as 


P “ -[P  (Zj^)  + q (w^)  + (-z^,-W2)]. 


(4.27) 


Now  notice  from  the  additive  separability  in  (4.18)  that 


f*(y)  - f*(yj^,y2)  ” p*(yj^)  + q*(y2)« 
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Using  this  together  with  the  fact  that 

•k 

we  can  rewrite  (4.27)  as 

u “ -[f*(y)  + y = (z^^.w^). 

Using  y > - oo  once  more,  we  obtain  y € K for  this  y,  and  so  (4.17)  is 
established.  This  argument  used  (4.22)  and  (4.25).  Since 

dom(p  + \p  ) = dom  p n dom  ip  , 

it  is  clear  that 


dom  p n dom  <|>  H ri  dom  q 
K 


(4.28) 


is  hypothesis  required  for  our  argument.  In  the  event  that  k is  not 
polyhedral,  we  simply  group  the  term  with  q rather  than  p and  then 

mimic  the  above  argument.  The  hypotheses  required  for  this  are 


and 


<(i  dom  p ri  dom(i|/j^+  q) 


({)  ?*  ri  dom  ip^  n ri  dom  q. 


By  [19,  Theorem  6.5],  these  are  equivalent  to  the  single  hypothesis 

0 dom  p ri  dom  (pv  9*  (4.29) 

To  finish  the  proof,  it  remains  only  to  check  that  the  condition 
assumed  in  the  proposition  amounts  to  either  (4.28)  or  (4.29),  according  to 
whether  P is  or  is  not  polyhedral.  Since  K - P><Q,  where  Q is  relatively 
open. 


dom  “ P Q and  rf  dom  4’j^ 


ri  P X Q. 
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Now  consider  the  remaining  sets  involved  in  (4.28)  and  (4.29).  By 
formulas  (4.18)  - (4.20),  together  with  the  additive  separability  of  P 
and  q in  terms  of  the  functions  f^,  i);^,  , the  issue  reduces  to  deter- 

mlnimg  formulas  describing  the  sets  dom  i|^^,  dom  and  also  their 
relative  interiors.  But  (2.1)  yields 


dom 


’^i  “ ^ 1 


and  so  by  [19,  Lemma  7.3  or  Theorem  6.8]  it  follows  that 

ri  dom  = {(Xj.,?^)|x^  e rl  dom  f^,  < O}. 

Also,  according  to  (4.8)  and  (4.9)  we  have 


dom  a.  = {(x.,£.)|^.  > 0 and  x.  £ dom  f . C • » C . = 0 and  x . ^ dom  h . } , 

j j j j j j j 3 3 3 


so  by  [19,  Theorem  6.8]  it  follows  that 

ri  dom  = {(Xj,?^)!?^  > 0,  Xj  e dom  f ^ . 

From  this  information  it  is  easy  to  see  that  the  condition  in  the  proposition 
is  equivalent  to  either  (4.28)  or  (4.29),  depending  on  whether  P is  or 
is  not  polyhedral.  This  concludes  the  proof. 

Alternative  conditions  which  ensure  (4.12)  can  be  developed  by 
combining  [21,  Theorem  17]  with  the  various  criteria  given  in  [21,  Theorem  18], 
much  as  in  the  proof  of  Proposition  5.  This  we  leave  to  the  Interested  reader. 

Finally,  we  remark  that  there  are  other  general  results  which  hold 
for  the  present  model  by  virtue  of  its  fitting  the  general  perturbational 
duality  framework.  We  refer  the  reader  to  [21,  espceially  §7],  from  which 
it  is  possible  to  deduce  these  further  results  concerning  the  symmetrized  model. 
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5.  Specialization  to  ordinary  progranuning:  comparison  with 
the  traditional  LaRrangian  model.  We  now  analyze  the  symmetrized  model 
of  §4  In  the  most  important  case, 

J = 0 , P = {(x^,x^)|x^  = x^,\/i  e i},  p » {(yQ,y^)lyQ  I “ O) , (5.1) 

to  see  how  it  compares  with  the  usual  duality  model  for  the  ordinary 
convex  programming  problem 

(P  ) min{f  (x  )lf.  (x  ) < O.Vi^  I }. 

O O O 1 O — 


We  shall  see  that  the  resulting  symmetrized  model  is  technically  distinct 
from  ordinary  duality,  yet  closely  related  and  in  fact,  under  generally  mild 
conditions,  essentially  equivalent.  The  symmetrized  model  thus  provides  an 
alternate  means  of  obtaining  facts  concerning  the  ordinary  convex  program. 

In  particular,  we  establish  in  this  way  a new,  sharpened  existence  theorem 
for  optimal  Lagrange  multipliers  in  the  finite-dimensional  case.  We  also 
derive  (directly)  under  minimal  assumptions  the  classical  Kuhn-Tucker 
optimality  conditions  for  the  case  of  nondif ferentiable,  not  everywhere 
finite  functions  in  general  spaces.  As  an  illustration  of  the  delicate 
interrelationships  between  the  ordinary  and  the  symmetrized  models,  we 
conclude  with  a discussion  of  the  case  of  homogeneous  constraint  functions. 
This  extends  recent  work  of  C.  R.  Glassey  concerning  "explicit  duality" 
for  such  problems. 

By  the  "ordinary  duality  model"  we  mean  the  trio  of  problems 
consisting  of  (P^)  together  with  its  so-called  Lagranglan  dual  problem 


max 

>0 


{ inf  (x  ) + 1 

__  oo  tllo 

X e c I 


I 


and  Lagranglan  saddlepolnc  problem 


(L  ) mlnlmax  {f  (x  ) + T n . f ^ (x  ) } 

° X ec  T „ ° ° ^ i 1 o 

o n i 0 

We  use  the  notation 


C « C n f ' c.  , C - dom  f , C - dom  f . V/l  ^ I 
oli  o o i 1“ 


(5.2) 


and  make  the  trivial  nondegeneracy  assumption  that  C 0.  In  the  case 

of  functions  not  everywhere  finite,  It  Is  essential  to  observe  the  restriction 

X € C In  both  (D  ) and  (L  ) . 
o o o 

This  trio  arises  from  the  general  perturbatlonal  duality  model 
by  choosing 


o o 


f (x  ) if  f (x  ) < y \/i  e I 
o o 1 o — 1 

+ “>  otherwise 


(5.3) 


Indeed,  min  {F  (x  ,0)}  Is  exactly  (P  ),  and  when  we  generate  G and  H 

3^  O O O O O 

from  F by  means  of  the  formulas 
o ^ 

Ho(Xo,n^)  - Inf  {F  (x^,y^)  + 


G (n^.v  ) - :^f  {H  (x  ,n^)  - <x  ,v  >} 

O O Xq  o o o o 

prescribed  In  [21,  see  eqs.  (4.2),  (4.15)],  we  easily  obtain  by  (5.3)  that 


H (x^,n^)  - ■ 
o o 

1 ^0^*0^  1 

00 

1 

1 

if 

if 

X € c 
o 

X € C 
o 

and  ^ 0 

and  ri^  0 

(5.4) 

[ + a> 

if 

Xo^C 
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(n^.v  ) = f inf  {f  (x  ) + I n.f. (x  ) - <x  ,v  >}  if  > 0 (5.5) 

O J..  CO  oo  siio  oo  — 


X 6C 

o 


otherwise 


From  these,  it  is  clear  that  the  problem  max  {G  (n^,0)} 

I ° 

n ‘■ 

coincides  with  (D  ) and  the  problem  minimax  {H(x  ,n^)}  coincides  with  (L  ). 

O Xf,  T o o 

n 

Now  consider  the  form  taken  by  the  symmetized  model  of  § 4 under 
the  particular  choice  (5.1).  The  primal  problem  (P)  becomes 

(P  ) min  {f  (x  ) I (x  ,x^,C^)  such  that  f . (x  ) < 0 \/  1 ^ I« 

S OO  O lO"* 

and  C^  = 0,  X =x.  \/i6i}. 

o 1 

The  extra  variables  (x^,5^)  here  are  of  course  completely  determined  and 
could  be  suppressed  without  harm.  Nevertheless,  for  the  purpose  of  main- 
taining a clear  distinction  between  the  ordinary  and  the  symmetrized  model, 
it  is  useful  to  keep  in  mind,  at  least  in  a formal  way,  the  presence  of 
these  additional  variables.  With  the  dual  and  Lagrangian  problems  (D^) 
and  (L^)>  analogous  distinctions  actually  can  make  a difference  in  terms 
of  solvability. 

The  dual  problem  (D)  when  specialized  according  to  (5.1)  becomes 

(Dg)  - min  + I (f  (y^)  10,  Yq  + I Yi  = 0}  , 

* * + 

where  is  f^O  whenever  = 0. 

The  result  to  follow  shows  that,  under  a usually  harmless  assumption,  we 

can  substitute  for  (D^ ) a simpler  dual  problem  not  requiring  the  recession 
* + 

functions  f^  0 : 
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(Dg)  - min  {f*  (y^)  + ^ (y^)  In^  >0,  y^,  + J y^  - o}, 

A 

where  is  '^'{q}  whenever  “ 0. 

The  result  also  provides  relatively  mild  conditions  under  which  solving 
(D^),  or  even  the  simplified  form  (D^)»  is  actually  equivalent  to  solving 
the  ordinary  dual  (D^) . 

PROPOSITION  7.  (a)  Assume  that  the  effective  domains  of  the 

functions  satisfy 


C C C 


..  V 


1 e I 


(5.6) 


(see  (5.2)).  Then  the  optimal  values  in  (D  ) and  (D  ) coincide.  Moreover, 

8 S 

any  solution  of  (D  ) is  a solution  of  (P^)  and  conversely,  if  (y  ,y^,n^) 

S 8 O 

solves  (D^)  then  (y^»y^,n^)  solves  (Dg)»  where 


y “ y y yj  and  y. 

■' o ■’ o ^ •'i  ■'i 


0,  i G I 


yf.  i e i^ 


(5.7) 


for  I = {1  G ijn  = 0}  , r = I \i  . 
o i + o 

(b^)  Assume  any  hypothesis  which  will  ensure  that,  for  each  ^ 0, 

-(fo  + I Vi>*^^o>  “ <^0^  ^ >’i  = '^o^  ‘ 1 

I II-  (5.8  ) 

A + 

where  is  and  f^  is  f^  0 whenever  = 0.^ 

Then  the  optimal  values  of  (D^)  and  (D^)  coincide.  Moreover,  if  (yQ»y^,n^) 

solves  (D^)  then  solves  (D  ),  and  conversely,  if  solves  (’^  ) then 

so  o 

there  exist  y and  y^  such  that  (y  ,y^,Tl^)  solves  (D^) 
o Os 

(b)  Assume  any  hypothesis  which  will  ensure  that,  for  each  ^ 0> 


L 
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L 


+ I n^f.)  (V^)  = min  (y^)  + J (f^  ^ I y±  ^ ^o'^' 

1+  1+ 

where  1+  = (i  ^ l|n^  > 0}  , 


(5.8) 


and  assume  that  (5.6)  holds  as  well.  Then  the  optimal  values  of  (D  ) and 

s 

(D  ) coincide.  Moreover,  if  (y  ,y^,n^)  solves  (D  ) then  solves 

o os 

(D  ),  and  conversely,  if  ri^  solves  (D  ) then  there  exist  y and  y^  such 


that  (y^.y^,!!^)  solves  (0^)- 


PROOF.  Observe  first  that  any  collection  proper  convex 

functions  having  lower  semicontinuous  hulls  somewhere  finite  satisfies 


inf{)^  ^.}  ^ inf  {cl<^. } 

Xo  ’ Xo 


= -(IM  (0) 


= - cl(«^^*D...Dvj/)(0) 


(5.9) 


> -(^-  □...CV  *)(0)  . 

— i m 


The  second  equality  here  follows  by  the  same  argument  as  given  in  [19, 


bottom  p.  145].  Using  (5.9)  and  then  'P^q}  ^ 0 » we  have 

inf{f  + I n.r}  = inf{f  + J + I n^f.} 

C ° Xn  l„^i  ^ 


(5.10) 


> -inf{f^  (y^)  + I (fi*ni)(yi) ly^  + I y^  = 0} 

1+  1+ 


for  each  n ^0.  This  establishes  the  general  relations 


val  (D  ) < val  (D  ) < val  (D  ) , 
s — 8 — o 


(5.11) 


*«+  . - *„+ 


Now  assume  condition  (5.6).  Then  4*,.  < 4^^  . and  hence  f . 0 > f 0 by 

II  ^ ° I 

Lemma  1 in  §3.  If  (yQ*y  »U  ) satisfies  U ^0,  y^,  J “ 0 ^ad 
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y G dom  f , then  we  can  write 
o o 


C<^>  + ^ (fjVxyi)  1 C<>'o>  * J 


> f^*(y^)  + y^) 

o 

= C^yo>  ■'J  '‘'{o}<yi>  • 


(5.12) 


* + 

The  second  inequality  is  by  the  subadditivity  of  0 (Lemma  1 shows  it  is 

* 

a support  function) , while  the  third  inequality  follows  from  y^  G dom  f ^ 

* + 

and  the  definition  of  recession  function  (see  ff.(2.2)).  Since  ^ 0 » 

equality  must  hold  throughout  (5.12).  Hence 


^o  ^ 4 

where  ^ ^ 4 ~ ^ ^i^  I “ 0*  This  shows  that  the  optimal 

T To  T + 

values  in  (D  ) and  (D  ) coincide  and  that  if  (y  ,y  ,r|  ) solves  (D  ) then 
s s o s 

the  (y^,y^,n^)  induced  as  in  (5.7)  solves  (D^)*  On  the  other  hand,  suppose 

that  (y^.y^jH^)  solves  (D^) . If  the  (common)  optimal  value  is  - then 

trivially  (y  ,y^,n^)  also  solves  (D^)  . So  suppose  val(D  )>-<».  Then 
o s s 

•k  4- 

necessarily  y^  = 0 for  each  i G so  that  = 0 = (f^  0 ) (y^) 

for  such  i's.  Hence  (y^jy^jH^)  yields  the  same  value  in  the  objective  function 
of  Tt  does  in  (D^) . Since  the  two  optimal  values  agree,  this  vector 

must  also  be  optimal  for  (0^) . Now  assume  condition  (5.8^).  This  yields 
equality, with  attainment  by  some  y's,  in  the  first  inequality  of  calculation 
(5.10).  The  conclusions  of  part  (b^)  are  immediate  from  this.  Finally, 
assume  conditions  (5.8)  and  (5.6).  These  yield 
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inf{f  + I n^f.}  = inf{f  + I >Pr  + I n.f.) 

C I ^ ^ Xo  ° lo  "'i  4 

“ inf{f  + y n. f.} 
o ' i 1 


= - minify  (y^)  + y (f^  n^)(yj^)|y^  + I = Oi 

4 1+ 

for  each  2.  conclusions  of  part  (b)  are  immediate  from  this. 

Condition  (5.6)  can  always  be  met  by  simply  redefining  (if 

necessary)  to  be  + “>  outside  C.  The  new  f^  will  then  satisfy 

» C C C^,  \/  i G I,  The  only  adverse  effect  likely  from  this  redefinition 

A 

is  the  possible  complication  of  the  formula  for  f^  . This  can  occur,  as 
for  example  in  the  case  of  programming  subject  to  homogeneous  constraints 
discussed  at  the  end  of  this  section. 

Conditions  (5.8)  and  (5.8^)  are  somewhat  more  restrictive  hypotheses 
but  still  relatively  harmless.  Indeed,  generally  speaking,  they  are  easier 
to  satisfy  (i.e.  weaker)  than  a constraint  qualification.  For  example,  ac- 
cording to  [21,  Theorem  20(a) j,  (5.8)  is  satisfied  in  general  spaces  if  there 

exists  an  x such  that  each  of  the  functions  f , f .(i  G I)  is  finite 
O O 1 

at  x^  and  all  except  possibly  one  of  them  are  bounded  above  on  some 
neighborhood  of  x^.  Alternatively,  if  = R^,  (5.8)  is  satisfied  whenever 


0 ri  C n (Cl  ri  C., 
o 1 1 


where  here  the  relative  interior  operation  can  be  deleted  for  any  and  all  of 
those  functions  f^,  f^  (i  € I)  which  happen  to  be  polyhedral  [19,  Theorems  16.4, 
20.1].  Other  conditions  sufficient  to  guarantee  validity  of  (5.8)  and  (5.8^) 
in  both  finite- and  inf inite-dimensional  contexts  can  be  formulated  using 
[21,  Theorem  20],  [6,  §§4e  and  9c]  and  [22,  Theorem  5.6.2].  For  a somewhat 
tangential  yet  related  discussion  of  the  consequences  of  (5.8),  see  [5]. 
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We  remark  that  in  the  presence  of  (5.6),  it  can  be  proved  that 

condition  (5.8)  is  equivalent  to  condition  (5.8^). 

From  Proposition  7(a)  it  follows  that,  when  (5.6)  can  be  arranged 

* 

(without  adverse  effects  from  complicating  f ),  we  might  as  well  deal 

o 

with  the  simplified  symmetrized  dual  (D  ) rather  than  the  technically 

s 

correct  version  • It  should  be  noted,  though,  that  in  so  substituting 

(D  ) for  (D  ) , we  will  in  general  be  discarding  certain  unbounded  portions 
s ® 

of  the  solution  set  of  (D  ).  But  this  is  immaterial  in  terms  of  the 

s 

actual  solvability  of  the  dual  problem,  so  long  as  (5.6)  holds  (cf.  (5.7)). 

In  connection  with  Proposition  7(a),  we  remark  that  even  without 
condition  (5.6)  an  asmptotlc  relationship  holds  between  (Dg)  and  (0^)* 
Namely,  by  using  [17,  Theorem  3b(e)]  one  can  show  that  to  each  feasible 
solution  (yQ,y^,Ti^)  of  (D^)  there  corresponds  a net  of  vectors. 

Indexed  by  a,  satisfying 


>0,  = li® 

a a a 


* I 


and 


If  (D  ) were  known  to  be  normal  (i.e.  if  the  perturbation  function 
s 

corresponding  to  (D^)  satisfied  (use  y)(0)  ^ y(0))»  it  would  follow  that 

val(D  ) » val(D^) 
s s 

and  hence  that  each  solution  of  (D  ) is  a solution  of  (D ^ ) . 

s s 

We  turn  our  attention  now  towards  a comparison  of  Kuhn-Tucker 
theories  associated  with  the  synimetrlzed  and  the  ordinary  models.  For 
the  ordinary  model  this  involves  the  traditional  Lagranglan  function 
given  in  (5.4),  while  for  the  symmetrized  model  it  involves  specializing 
the  function  H given  by  (4.5)  according  to  (5.1).  This  yields  H^,  defined 


{ 


] 

I 

i 

i 
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as  follows: 


H (x  ^ ) - <(x  ,x^,C^),(y  ,y^,n^)> 

£>  O w V w O w 


(5.13) 


if  the  "x-conditions" 


X € dom  f , f .(x. ) + ^.  <0\/i€I 
o o i i i — . 


and  the  "y-conditions" 


e R^,  y + j;  y = 0 
I 


are  both  satisfied;  H (x,y)  = - “ if  these  x-conditions  are  met  but  the 

s 

y-conditions  fall;  and  H^(x,y)  = + “>  if  the  x-conditions  fail.  The 

Lagrangian  function  plays  the  same  role  for  (Pg)  and  (D^)  (and  also 

(D  ) when  (5.6)  holds)  as  H plays  for  (P  ) and  (D  ) . In  particular,  the 
s o o o 

pairs  (x^,x^, ^^) , (y^,y^,n^)  characterized  in  Proposition  9 below  as  the 
solutions  to  the  "symmetrized"  Kuhn-Tucker  conditions  are  precisely  the 


saddlepolnts  of  the  minimax  problem  (Lg)  determined  by  H^. 

The  Kuhn-Tucker  vectors  for  (P^)  are,  according  to  (5.1)  and 
the  general  definition  in  §4,  those  (y^.y^.H^)  which  satisfy 


i’’  > 0.  y^  + J = 0 


val(Pg)  =*  -[f^  (y^)  + I (f^  0'‘')(y^)  + J (f^  n^)(yj^)]  ^ R, 

lo  1+ 

where  as  usual  we  write  I = {i  € l|n,  = 0}  and  = I \ I . In 

o i + o 

particular,  they  are  solutions  of  (D^)  . Also  of  interest  are  the  vectors 

s 

which  satisfy  the  stronger  conditions 


(5.143“^) 


(5.14b‘^) 


n i 0.  y^  + 


I y^  - 0.  y^  - 0 V 


i e I 


(5.14a) 
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vai(P  ) « -[f  *(y„)  + I (f/n.)(yJl  e R. 

S O O 1 1 1 


(5.14b) 


Vectors  (y^.y^jH^)  satisfying  the  latter  pair  of  conditions  will  be 


called  strong  Kuhn-Tucker  vectors  for  (Pg)»  They  are  contained  in  the 


solution  set  of  the  simplified  dual  (Dg)-  The  Kuhn-Tucker  vectors  for 


(P^)  are  the  n which  satisfy 


> 0,  val(P  ) = inf  {f  + 7 ^ R* 

o o i i 


(5.15) 


They  are  also  called  optimal  Lagrange  multipliers  for  (P^)*  The  next 


result  describes  the  interrelationships  among  these  objects. 


PROPOSITION  8.  Let  > 0 be  given,  and  write  I^  - {i  e i | > 0} 


and  I^  ■ I The  implications 


(b)  ■*  (b")  ^ (a) 


hold  among  the  conditions  below.  Furthermore,  if  (5.8^)  holds  then  (b^) 


is  equivalent  to  (a),  and  if  (5.6)  holds  then  (b)  is  equivalent  to  (b  ). 


If  both  (5.8)  and  (5.6)  hold,  the  three  conditions  are  mutually  equivalent. 


(a)  n is  a Kuhn-Tucker  vector  for  (P^) » l.e.  (5.15)  holds; 


(b^)  there  exist  y^  and  y^  such  that  (yQ,y^,n^)  is  a 


Kuhn-Tucker  vector  for  (P  ),  i.e.  (5.14  ) holds. 

s 


(b)  there  exist  y^  and  y^  such  that  (yp»y^,n^)  is  a strong 


Kuhn-Tucker  vector  for  (P  ),  l.e.  (5.14)  holds. 

s 


PROOF.  Note  first  that  to  go  along  with  the  general  relation 


(5.11)  established  above  we  also  have 


val(D  ) < val(P  ) ■ val(P  ). 
0—0  s 


(5.16) 


The  implication  (b)  ♦ (b^)  is  trivial  from  the  fact  that  f^  0^(0)  = 0. 
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Suppose  (y^»y^r|^)  satisfies  (5.14^).  By  the  estimate  (5.10)  together 
with  the  general  inequalities  (5.11)  and  (5.16),  it  follows  that 
satisfies  (5.15).  Hence  (b^)  =*  (a).  Next,  suppose  satisfies  (5.15) 

and  that  condition  (5.8^)  holds.  Then  by  Proposition  7(b^),  there  exist 
y^  and  y^  such  that  (5.1A^)  holds.  Hence,  (a)  ^ (b^)  in  the  presence 
of  (5.8^).  Now  suppose  (y^»y^in^)  satisfies  (5.14^)  and  that  condition  | 

(5*6)  holds.  Define  y^  and  y^  as  in  (5.7).  By  Proposition  7(a)  it 
follows  that  (y^.y^.n^)  satisfies  (5.14).  Hence,  (b^)  ••  (b)  in  the 
presence  of  (5.6).  The  final  assertion  follows  by  combining  what  has  already 
been  proved,  using  the  fact  (remarked  earlier)  that  (5.8^)  is  equivalent  to 
(5.8)  in  the  presence  of  (5.6).  Alternatively,  apply  Proposition  7(b). 

This  completes  the  proof. 

Propositions  3 and  4 and  Corollary  4A  from  §4  can  all  be  specialized 

according  to  (5.1),  of  course,  to  yield  comparable  assertions  concerning  the 

trio  (P  ),  (D^)  , (L  ) . We  will  not  write  all  of  this  down,  though.  Instead, 
s s s 

we  focus  on  just  that  part  of  Proposition  4 characterizing  the  Kuhn-Tucker 
conditions,  to  see  what  they  look  like  for  (P^)* 

PROPOSITION  9.  A pair  of  vectors  (x^,x^,?^)  and  (y^ty^.H^) 
satisfies  the  Kuhn-Tucker  conditions  for  (P^)  if  and  only  if  it  satisfies 
the  conditions 


where  I 

o 


x^  - x^V  1 € I,  = 0,  Xq  + I 
^o  ^ 3^0 ^^o^’ 

f . (x  ) < 0 < n.  and  f (x  ) 'n,  “ 0 ^/i  S l, 

lol 

^i  ^ ^’^dom  f^^*o^  V i ^ ^1  ^ V i e 1^  , 

{i  e l|n.  • 0}  and  I . ■ I Xl  . When  (5.6)  holds,  it  is 
' 1 + o 


(5.17a) 

(5.17b) 

(5.17c) 

(5.l7d'*‘) 

i 

I 

( 
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possible  to  satisfy  the  preceeding  conditions  if  and  only  if  it  is  possible 
to  satisfy  the  simplified  conditions  obtained  by  replacing  (5.17d^)  with 

y^  = 0 ''/ i € and  y^  € n^3f^(x^)  V 1 ^ I_j,*  (5.17d) 

PROOF.  The  first  assertion  is  immediate  from  Proposition  4, 
in  view  of  (5.1).  Now  observe  that 

whenever  > 0 and  3f^(x^)  i 0 , 

and  that 

0 € 3iJ;_  (x  ) whenever  x G C.  . 

o o i 

From  these  facts  it  is  clear  that  any  pair  (y^.y^,!!^)  which 

satisfies  v5.17)  must  also  satisfy  (5.17^).  Now  suppose  we  have  such  a 
pair  satisfying  (5.17^),  and  assume  that  condition  (5.6)  holds.  Define 
(y^,y^,n^)  as  in  (5.7).  It  is  clear  that  the  given  (x^,x^,^^)  together 
with  (y^ty^.n^)  satisfy  everything  in  (5.17)  except  possibly  requirement 

(5.17d).  But  we  have  x G C C C . , from  which 

0 o 1 

DU;  (x^)  C DU;^  (x^) 

1 o 

follows  easily.  We  also  have  the  easy  fact  that 

Df  (x  ) +DU;  (x  ) C Df  (x  ) . 

O O Co  o o 

o 

Combining  this  information  yields 

^o  = ^o  } H 

e Df„(x^)  + I 

C Df  (x  ) + DU/p  (X„) 
o o * o 

lo  O 


c 3f.(x  ). 
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and  so  the  proof  Is  complete. 

The  extremallty  conditions  just  derived  for  the  symmetrized  model, 

at  least  in  the  simplified  form  (5.17),  are  extremely  closely  related  to  the 

well  known,  classical  Kuhn-Tucker  conditions.  In  order  to  make  a precise 

comparison  between  the  two  in  the  present  context,  we  now  derive  the  classical 

Kuhn-Tucker  conditions.  These  are  obtained  as  part  (b)  of  the  following  result, 

which  extends  [19,  Theorem  28.3  (see  1972  edition}]  to  the  general  case. 

PROPOSITION  10.  (a)  In  order  that  x solve  (P  ) and  be 

o o 

a Kuhn-Tucker  vector  for  (P  ),  it  is  necessary  and  sufficient  that  (x  ,r)^) 

o o 

solve  (L  ) . 
o 

(b)  In  order  that  (x  ,ti^)  solve  (L  ) it  is  sufficient  that  (x  ,ri^) 

o o o 

satisfies  the  conditions 

f . (x  ) < 0 < n.  and  f . (x  ) ‘n . “ 0 1 G I (5.18a) 

lo—  — i ioi 

and 

0 e 3f  (x^)  + I Ti.3f.(x  ),  (5.18b) 

' O O ^ 1 1 o 

where  I^  = {i  G lln^  > 0}  . When  (5.6)  and  (5.8)  hold,  these  conditions 
are  also  necessary. 

PROOF.  Most  of  part  (a)  follows  immediately  from  the  equivalence 

between  (e)  and  (f)  in  [21,  Theorem  15],  since  the  function  F defined 

o 

in  (5.3),  which  underlies  the  ordinary  duality  model,  is  clearly  closed 
convex  in  for  each  fixed  x^.  The  flnlteness  of  the  common  optimal 

value  for  the  necessity  half  of  (a)  is  built  in  to  the  definition  of 
Kuhn-Tucker  vector.  Finiteness  in  the  sufficiency  half  follows  from  the  fact 
that  if  (x^,n^)  is  a saddlepoint  of  then  x^  G c and  > 0,  in 

which  case  the  saddlevalue  is  finite.  This  is  easy  to  deduce  directly  from 
(5.4).  (It  requires  our  nondegeneracy  assumption  C ^ 0.)  Now  let  us 
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establish  (b).  As  just  noted,  if  (*Q»n  ) is  a saddlepoint  of 


then  necessarily  € c and  n ^0.  From  this  it  follows,  using  (5.4), 


that  the  saddlepoint  condition  is  equivalent  to  the  conditions  (x^  G C, 


n >0  and) 


I (fi^-nj^)*f^(x^)  < 0 , V 


and 


Now  it  is  easily  seen  (in  the  presence  of  x £ C,  r|  ^ 0)  that  the 


first  of  these  is  equivalent  to  (5.18a)  and  that  the  second  is  equivalent  to 


0 e 3(fo  + I n^f^Xx^), 


where  is  interpreted  as  whenever  = 0,  or  in  other  words,  to 


the  condition 


oe8(t^  + It  +In^£i)Uo) 

Id  ^ ^+ 


(5.18b  ) 


where  = {i  e l|n^  = O}  and  The  remaining  analysis  concerns 

* X 

breaking  (5.18b  ) down  further.  Suppose  ) satisfies  (5  together 

with  (5.18b).  Then 


0 ^ J {0>  + I ^i^^i^%^ 


C 3F^(x^)  +l  + I n^3fi(x^) 

lo  i 

c 3(£__  t I 3*  t J njfj)(K„) 


h 1 4 


shows  that  (5.18b  ) holds.  The  first  inclusion  here  follows  trivially  from 


the  fact  0 € d\l>  (x  ) , while  the  second  is  an  elementary  fact  concerning  the 
Ci  o 


J 


f 
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subdifferential  of  a sum  (see,  e.g.  [19,  proof  of  Theorem  23.8)).  This 

establishes  the  sufficiency  part  of  assertion  (b).  Now  assume  conversely 

I ^ 

that  (x^«n  ) satisfies  (3.18a)  together  with  (5.18b  ) and  that  conditions 

(5.6)  and  (5.8)  hold.  By  (5.6)  we  have  that 


fo  ^ '^C  ^ "^i^i  “ ^o  ^ '^i^i 


^o  ^ 


and  hence  (5.18b  ) simplifies  to 


0€  3(£„f  I . 


From  (5.8)  it  follows  by  an  elementary  argument  that 


(Such  an  argument  can  be  found  in  [5,  end  of  §2].)  Combining  the  last  two 
facts,  we  obtain  (5.18b),  thus  completing  the  proof. 

It  is  clear  that  the  ordinary  Kuhn-Tucker  conditions  (5.18)  are 
satisflable  if  and  only  if  the  simplified  conditions  (5.17)  corresponding 

4- 

to  the  symmetrized  model  are  satisflable.  Similarly,  conditions  (5.17  ) 
are  satisflable  if  and  only  if  conditions  (5.18^)  are,  where  by  (5.18^)  we 
mean  (5.18a)  together  with 


Df  (x  ) + 
o o 


I 


r 

o 


+ I n.3f . (x  ) . 
-j.  i 1 o 


(5.18b'^) 


COROLLARY  lOA.  ("Kuhn-Tucker  Theorem")  If  conditions  (5.6), 

(5.8)  and  the  strong  duality  relation  inf(P  ) = max(D  ) hold,  then  in  order 

o o 

that  X solve  (P  ) it  is  necessary  that  there  exists  an  which  together 

o o 

with  x^  satisfies  the  Kuhn-Tucker  conditions  (5.18),  Conversely. 

if  (x  ,n^)  satisfies  (5.18),  then  x solves  (P  ) and  is  a Kuhn-Tucker 

o o o 

vector  for  (P  ). 


5i 


I 

I 


T 


? 


PROOF.  The  converse  assertion  is  immediate  from  the  proposition. 

Suppose  now  that  inf(P  ) * max(D  ) =*  JJ.  From  (5.5)  it  is  clear  that  no 

o o 

can  yield  value  + “>  in  (D  ) , so  since  (D  ) has  a solution  we  must  have 

o o 

y < +“.  (This  uses  our  nondegeneracy  assumption  C 0;  see  (5.5).)  Suppose 

(P  ) has  a solution.  Then  y > - «>  , because  (5.3)  shows  that  no  x 
o o 

can  yield  value  - in  (P^) • Hence  y must  be  finite,  and  the  hypothesized 
solution  to  (D^)  is  actually  a Kuhn-Tucker  vector  for  (P^) ♦ Now  apply  the 
proposition. 


Notice  that  Proposition  10  and  Corollary  lOA  are  the  analogues,  for  the 
ordinary  duality  model,  of  Proposition  4 and  Corollary  4A.  For  completeness, 

we  provide  the  analogue  of  Proposition  3.  This  extends  [19,  Theorem  28.1] 
to  the  general  case. 

PROPOSITION  11.  Let  0^  be  a Kuhn-Tucker  vector  for  (P  ).  Then 

o 

the  optimal  solutions  to  (Pp)t  if  any,  occur  among  the  global  minlmizers 
of  the  function 


x 

o 


^ H^(x^,n^)  = 

o o 


f I if  ^ ^ 

o o ^ 1 1 o o 

^ “ otherwise  . 


In  particular,  x solves  (P  ) if  and  only  if 
o o 

X solves  inf  {f  + I H.f.  ) 
o ^ o ^ i i 

and  also  satisfies 


(5.19) 


f, (x  ) < 0 < n,  and  f (x  ) ‘H . 

lO  — — X lOl 


0,  \/  ±e  I. 


(5.20) 


PROOF.  We  can  obtain  this  result  very  succinctly  as  a corollary 

to  Proposition  10,  as  follows.  Since  r)^  i®  ® Kuhn-Tucker  vector,  the  proof 

of  Proposition  10  shows  that  x^  solves  (P^)  if  and  only  if  (x^,n^) 

* 

satisfies  (5.18a)  and  (5.18b  ).  But  these  conditions  are  the  same  as  (5.20) 


\ 


t 


and  (5. 19) respectively . An  elementary,  direct  proof  can  also  be  given. 

This  we  leave  to  the  reader. 

Next,  we  discuss  constraint  qualifications.  This  is  the  term 

usually  given  to  any  of  a variety  of  conditions  which  guarantee  the 

existence  of  a Kuhn-Tucker  vector,  or  at  least  guarantee  the  strong  duality 

relation  (4.12)  for  the  model  under  study.  Propositions  8 and  7 imply 

that  any  result  along  these  lines  for  the  symmetrized  model  (involving  either 

(D^^)  or  (Dg))  yields  the  corresponding  result  for  the  ordinary  model. 

Sharp  conditions  under  which  the  converse  implications  hold  are  provided 

also  in  Propositions  8 and  7.  Notice  in  particular  that  when  conditions 

(5.6)  and  (5.8)  both  hold,  it  makes  no  difference  for  which  of  the  (three) 

models  one  establishes  such  results.  That  is,  under  these  conditions 

it  is  imnaterial  whether  one  derives  the  result  for  (P  ) and  (D  ) , for 

o o 

(P  ) and  CJ^)  , or  for  (P  ) and  (D  ) . 
s ' s s s 

This  raises  the  question  of  whether  one  can  generate  weaker 

(hence  better)  constraint  qualifications  by  working  with  the  symmetrized 

model.  In  general  the  answer  is  no,  and  the  reason  is  as  follows. 

Constraint  qualifications  are  intimately  tied  up  with  the  optimal  value 

function's  being  bounded  above  on  some  neighborhood  of  the  origin  in  U, 

the  space  of  perturbations  (see  [21,  &.  ').  Now  the  U involved  in  the 

symmetrized  model  is  X x X ^ x R^,  since  the  model  includes  horizontal 

o o 

translations  of  the  functions  f , f . (i  G I)  as  well  as  vertical  translations 

o i 

of  the  the  other  hand,  U for  the  ordinary  model  is  simply 

R^,  since  only  the  vertical  translations  are  involved.  So  if  X^  is 

infinite-dimensional,  it  is  generally  more  difficult  to  ensure  the  boundednes 

property  for  tp  , the  optimal  value  function  for  (P,),  than  it  is  for 
s & 

'P  , the  optimal  value  function  for  (P  ). 
o o 
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To  Illustrate,  consider  the  form  taken  for  (P^)  by  the  all- 
purpose constraint  qualification  presented  in  Proposition  5.  It  is  that 
there  should  exist  an  x satisfying  both  (1)  f is  bounded  above  on 

O J ^ Q 

some  neighborhood  of  x^,  and  (2)  for  each  i € i there  exists  an 

> 0 such  that  ~ ~ some  neighborhood  of  x^.  This  qualification 

is  essentially  based  on  applying  [21,  Theorem  18(a)]  to  F^,  as  an  in- 
spection of  the  proof  of  Proposition  5 quickly  reveals.  It  guarantees 
an  optimal  (yQ,y^,r|^)  for  (D^)  , where  the  component  is  optimal  for 

(D^)  by  Proposition  7 (see  the  estimate  (5.10)).  By  contrast,  applying 
the  same  tool  [21,  Theorem  18(a)]  to  yields  a considerably  weaker  (hence 

better)  constraint  qualification,  namely,  that  there  should  exist  an  x^ 

in  dom  f such  that  f . (x  ) <0  for  every  i € I (the  so-called  Slater 
o i o 

condition).  The  latter  condition,  though  weaker,  does  however  have  the 

countervailing  aspect  of  guaranteeing  only  an  optimal  for  (D^) ■ 

Lacking  further  assumptions,  it  is  not  enough  to  ensure  in  general  the 

existence  of  an  "optimal"  pair  y ,y^  which,  together  with  n^,  will 

o 

solve  (Dg)  » condition  (5.8)  or  (5.8^)  would  typically  be  required  for 
that  (see  Proposition  7).  In  situations  where  not  both  (5.6)  and  (5.8) 
hold,  there  is  then  a certain  tradeoff:  constraint  qualifications  for  the 
ordinary  model  are  generally  weaker,  but  the  conclusions  implied  are  not 
as  strong  as  for  the  symmetrized  model. 

In  the  case  = r",  it  turns  out  that  the  general  difficulties 
mentioned  above  do  not  apply.  This  is  due  in  part  to  the  availability  of 
the  considerable  arsenal  of  special  facts  concerning  relative  interiors. 
Combined  with  the  additive  separability  structure  which  is  the  distinguishing 
feature  of  the  symmetrized  model,  this  permits  the  following  particularly 
refined  existence  result.  It  extends  slightly  Rockafellar's  theorem 


r 
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[19,  Theorem  28.2],  which  already  refined  the  Slater  condition  to  handle 
affine  functions. 

PROPOSITION  12.  Assume  X = and  val  (P  ) > - “>.  In  order 

o o 

that  there  exist  a Kuhn-Tucker  vector  for  (P  ) (and  hence  a fortiori  a 

s 

Kuhn-Tucker  vector  for  (P  ))  it  is  sufficient  that  there  exists  an  x 

o o 

satisfying  the  conditions 


X € ri  dom  f 
o o 


(5.21) 


and 


X 6 ri  dom  f . , f . (x  ) < 0 
o 1 1 o 


(5.22^) 


for  each  i e I.  Moreover,  when  any  or  all  of  the  functions  f , f ^ (i  e I) 

o i 

-ire  polyhedral,  the  corresponding  conditions  (5.21),  (5.22^)  can  be 
weakened  as  follows: 


X G dom  f , 
o o 


(5.21') 


fi(Xo)  < 0 . 


PROOF.  Proposition  6 specialized  according  to  (5.1)  yields  a 

Kuhn-Tucker  vector  (y^»y^,Tl^)  for  (Pg)  • By  the  implication  (b^)  =*  (a) 

of  Proposition  8,  the  component  is  a Kuhn-Tucker  vector  for  (P  ). 

o 

We  conclude  this  section  with  a brief  discussion  of  the  case  in 
which  all  of  the  constraint  functions  are  (translates  of)  homogeneous 
functions.  That  is,  we  assume  now  that  each  f^  has  the  form 


(5.22!) 

1 


f^(x^)  = h^(x^-a^)  - a^,  (5.23) 

where  h^  is  positively  homogeneous  of  degree  one  and  is  assumed  to  be 
closed.  Now  it  is  easy  to  check  that  a closed  proper  convex  function  is 
homogeneous  if  and  only  if  its  conjugate  is  an  indicator  (l.e.  assumes  only 
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the  values  0 or  +°°  ) . Hence, 


h.*  = 


for  some  nonempty  closed  convex  set  D^,  and  in  fact  one  has 

D.  = {y  € Y |<x  ,y  > < h (x  ),  G X } 

X o o'  o o — i o o o 


= ah^(O) 


(cf.  [19,  Corollary  13. 2. 1]) . From  this  it  is  easy  to  compute  that 


(fi  d/^o^  Vi 


(5.24) 


1 1 

tor  any  ^ 0.  Here,  for  notational  convenience  we  use  the  value 
* 0 to  represent  the  case  of  0^,  i.e. 

(ffVxyo)  = (y^)  + <a.,y^>. 

i 

Due  to  formulas  (5.23)  and  (5.24),  the  various  problems  treated 
earlier  in  this  section  assume  the  following  form: 


(D  ) 
o 


min{f^(x^)  |h^(x^-a^)  £ a^,  ^ 

max  { inf  (f  (x  ) + I n,.  [h  (x  -a  )-  a,]}}; 
nl>0  ^o^C  o o j 1 X o 1 i 

-min{f^*(y^)  + I[<a^.yo>  + n^a^]|n^  >0,  y^  + Jy^  = 0,  y^  e x/  i e i}, 


where  ri.D.  is  0 D.  whenever  n.  = 0: 

i X X i 


same  as  (D  ) , except  for  n.D.  being 

(Dg)  • ® ^ ^ 

interpreted  as  OD^  = {O}  whenever  = 0 . 

In  a recent  paper  [2],  C.  R.  Glassey  essentially  argued  that  solving 
(D  ) is  equivalent  to  solving  the  projection  of  (D^)  onto  R , i.e.  solving 
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for  those  n'*’  ^ 0 satisfying 


val(D^)  = -inf{f^  (y^)  + It<a^,y^>  + ^^“^11x0  Ixi  = 0.  X^  ^ n.D^ 


n , D . , t/  i t i 


The  proof  given,  treating  under  certain  assumptions  the  case  in  which 
= r",  the  a^'s  are  all  zero,  and  f^  is  linear,  has  gaps  however 
(see  [2,  p.  181]),  and  is  conclusive  only  for  functions  h^  which  are 
everywhere  finite.  (E.g.  the  formula  derived  just  above  equation  (5a) 
requires  C = in  our  terminology.)  The  difficulties  which  can  arise 
when  C ^ X^,  or  more  generally  when  (5.6)  fails,  are  Illustrated  by 
the  following  examples. 

2 

EXAMPLE  1.  Suppose  X^  = R and  I = {l,2},  and  let  f^,  f^, 
be  as  follows: 


o o 


^2  (Vq  e R), 

-2(qC2)^  if  1 0 

+ «>  otherwise. 


+ 00 


if  > 0 
otherwise. 


This  yields  for  the  primal  problem 


val(P^)  = 0, 


s 

I 


2 

achieved  on  {x  1 C.  > 0 = , and  for  the  ordinary  dual  (note  that  C = R 

o 1 — 2 + 

here), 

val(D  ) = 0, 
o 

achieved  on  ® *12} • Now  it  is  not  hard  to  verify  that  the  conjugates 

of  f ,f,,f„  are  the  indicator  functions 
o 1 2 


1 
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f 0 if  y = (0,1) 
o 

+ <*>  otherwise 
0 If  0 and  nj^n2  ^ i 

+ 00  otherwise  . 

0 if  £ 0 and  ^2  “ 0 
+ oo  otherwise , 

From  this  it  follows  that  for  the  symmetrized  dual, 

val  (Dg)  » 0 , 

achieved  on  { (y^.y^.H^)  In^,  * 0 _<  n2*  Yq  “ (0,1)  » “ (0*0)K  while 

for  the  simplified  symmetrized  dual, 

val  (D  ) = - “>  (infeasible), 
s 

The  latter  means,  f course,  that  the  projection  of  (D^)  onto  the  space  = R 

of  multipliers  ri^  also  has  value  -o®  and  is  infeasible. 

This  example  shows  that  (D^),  and  its  projection,  can  be  hopelessly 

inadequate,  with  in  fact  an  infinite  gap  between  val(D  ) and  val(D^ ), 

s s 

even  when  both  (5.8)  and  (5.8^)  are  satisfied  and  there  exists  a Slater  point. 

Condition  (5.6)  fails  here.  The  next  example  is  a slight  variation,  involving 
a nonlinear  f , illustrating  a finite  gap  between  val(D  ) and  val(D^), 

O 8 S 

where  both  values  are  achieved.  f 

EXAMPLE  2.  Let  everything  be  the  same  as  in  Example  1 except  for 

replacing  the  f there  by 
o 

f (x^)  - e R), 

o o 1 


C^>^o> 


so  that 
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f *(y„) 


n2ln  - n2  if  * o and  n2  > 0 


0 

+ » 


if  = 0 = ri2 
otherwise 


Then  one  can  check  that 


val(P  ) “ 1, 
o 


achieved  on  {x  IC,  > 0 « 

O i — L 


val(D^)  - 1, 


achieved  on  “ 0 £ 112^ 


vai(D  g)  “ i» 


achieved  on  {(y^.y^.H^)  = 0 1 n2.  y^,  “ (O.D  ' “yi*y2  ^ ^2^  ’ 


whereas 


val(D^)  = 0, 


achieved  on  { (y^.y^.n^)  IHj^  = 0 1 ^2*  y^  * (0*0)  = y^^  = 

According  to  Proposition  7,  if  condition  (5.6)  holds  then  solving 
(D^)  is  essentially  equivalent  to  solving  the  simpler  (D  ),  and  if  in 

3 • ’ S 

addition  (5.8)  holds  then  solving  (D  ),  or  its  projection,  is  essentially 

s 

equivalent  to  solving  (D^) • The  pair  of  conditions  (5.6),  (5.8)  is 

weaker  than  the  conditions  imposed  in  [2],  and  furthermore  does  not  require 

linearity  or  even  homogeneity  of  f^.  When  (5.6)  cannot  be  conveniently 

arranged,  though  (as  for  example  when  f^  is  linear  and  the  h^'s  are  not 

everywhere  finite),  then  according  to  Proposition  7(b^)  we  could  still 

use  (D^)  , or  its  projection  onto  R^,  as  a satisfactory  dual  provided 
8 

(5.8^)  is  satisfied. 
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6.  Tlie  ordinary  model  as  the  projection  of  the  symmetrized  model. 

In  the  last  section  we  saw  that  the  problems  (P  ) , (D  ) , (L  ) in  the 

o '*  o 

ordinary  duality  model  can  each  be  regarded  as  essentially  a projection  of 

the  corresponding  problem  (P  ) , (D  ) (or  (D^)),  (L  ) in  the  symmetrized 

s s s s 

duality  model.  We  shall  now  show  that  in  fact  the  entire  ordinary  problem 
trio  collectively  is  the  projection  of  the  entire  symmetrized  problem  trio. 

This  we  do  by  showing  that  the  three  projection  transformations  underlying 
the  phenomena  in  §5  are  interrelated  in  a certain  well  prescribed  way, 
much  as  are  the  three  function  F,  G,  H which  characterize  the  three 
problems  in  a perturbational  duality  model. 

The  projection  transformations  involved  are  linear  transformations 
which  are  not  everywhere  defined.  We  find  it  appropriate  to  view  them  as 
oriented  convex  processes  (see  Rockafellar  [19,  §39]  for  definitions). 
Specifically,  we  shall  exhibit  three  oriented  convex  processes,  call  them 
M,  N,  L,  which  are  interrelated  by  the  adjoint  operation  of  [19]  and  which 
satisfy  the  relations 

MF  = F , NG  = G , LH  = H . (6.1) 

o o o 

The  left-hand  sides  of  these  relations  represent  certain  operations  of  forming 

Images  of  (convex,  concave,  or  convex-concave)  functions  under  (variously 

oriented)  convex  processes;  these  operations  will  be  explained  as  we  go. 

Here  and  throughout,  the  functions  F , G , H are  those  of  (5.3),  (5.5),  (5.4) 

o o o 

while  the  functions  F,  G,  H are  those  of  (4.1),  (4.3),  (4.5)  specialized 
according  to  (5.1).  Each  of  the  three  relations  in  (6.1)  expresses  one  of 
the  individual  projection  phenomena  of  §5.  It  is  the  further  fact  that 
M,  N,  L are  interrelated  by  means  of  the  adjoint  operation  which  corresponds 
to  our  assertion  that  the  whole  symmetrized  trio  projects  onto  the  whole 
ordinary  trio.  Viewed  another  way,  the  results  to  follow  establish  under 
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mild  conditions  the  conmutativity  of  certain  constructions  involving 

taking  conjugates  of  functions,  adjoints  of  processes,  and  images  of  functions 

under  processes. 

Let  M:  X X U -►  X be  the  sup-oriented  convex  process 

given  by 


M(x,u) 


{(x  ,p^)} 
o 


if  u = 0 and  u^  = 0 
o 

otherwise 


(6.2) 


Since  F is  convex  and  M is  sup-oriented,  the  function 

MF:  X^  X 1,  called  the  image  of  F under  M,  is  defined 

[19]  by 


(MF)(x^,M^)  * inf {F(x,u) 1 (x,u)  GM  ^(x^,y^)}. 


(6.3) 


It  is  not  hard  to  show  that  this  construction  results  in  a convex 
function.  What  we  wish  to  note  is  the  following. 

PROPOSITION  13.  The  idenity  MF  * F^  holds  without  any 


additional  assumption. 

PROOF.  We  can  simply  compute  that 

(MF)(x  ,y^)  = inf  {f(x  ,X^C^)  + 4;  (x  ,X^,C^  + y^)} 

° I .1  ^ ° 

X 

* inf  tfpCXp)  + I ^i 


= f„(x  ) + I 4»  (x  ,-y  ) 
o o j i o 1 


Here  we  have  used  (6.3),  (6.2),  (4.1),  (5.1),  (2.1)  and  (5.3). 
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It  is  not  hard  to  calculate  from  the  definitions  in  [19]  that 

the  inverse  of  the  adjoint  of  M is  the  sup-oriented  convex  process 

*-l  I 

M :VxY-»-V^xR  given  by 


M*“^(v,y) 


{(v  if  v^  = 0 and  = 0 

o 

0 otherwise 


Now  define  an  inf -oriented  convex  process  NrY^V-^R^^V  by  switching 

*-l 

the  order  of  variables  in  M and  reversing  its  orientation; 


N(y,v)  = 


((n^.v  )}  if  v^  = 0 and  = 0 


0 


(6.4) 


otherwise 


Since  G is  concave  and  N is  inf-oriented,  the  function  NG:  R^  x 
called  the  image  of  G under  N,  is  defined  [19]  by 


(NG)(n^,v^)  = sup  {G(y,v) 1 (y,v)  e N ^(n^,v^)}.  (6.5) 

Again,  it  is  not  hard  to  show  that  this  construction  results  in  a concave 
function.  What  is  of  interest  here  is  the  following. 

PROPOSITION  14.  If  (5.8"*”)  holds,  then  NG  = G^.  Moreover,  this 

A ^ 1 

identity  is  equivalent  to  the  identity  (MF)  = M ~ F .. 

PROOF.  Using  (6.5), (6.4), (4.3)  and  (2.6)  yields 


(NG)(n  ,v^) 


suPi{-';^j^*(yo,y^,n^)  - f*(yjj+v^,y^,n^)} 


rfy  . o’  ■ I 


By  (2.8)  this  equals 


-“  when  > 0 falls,  while  if  > 0 holds  it 


equals 
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sup  - I (f/n-Xy^)}. 

+)!y.=o  I 


where  is  0 when  = 0.  (It  can  be  shown,  incidentally, 

that  this  quantity  is  at  most  even  without  (5.8^).)  Assuming 

that  (5.8^)  holds,  we  therefore  have  that,  for  > 0, 


(NG)(n  ,V^)  = - inf  {f  (y  ) + ^(f  n,)(y.)} 

yo*l^±-''o  I 

' - + 1 

X V/  X 

O 

= G (r)^,v  ), 
o o 


where  the  function  f . is  when  n . “ 0 and  we  have  used  (5.5). 

1 1 X 

For  the  equivalence  assertion,  note  first  that 


-G^(V,v^)  * F *(v^,n^)  = (MP)*(v  ,n^) 
o o o o o 


by  [21,  eq.  (4.17)]  and  Proposition  13.  Using  [21,  eq.  (4.17)]  again,  we  have 


•(NG)(-n  ,v^) 


- sup{G(y,v) I (y,v)  e N ^(-n^,v^)} 


- sup{G(y^,y^,-n^,v^,o,o)  iy^^.y^} 

- sup{G(-y  ,-y^,-n^,v  ,0,0) |y„,y^} 


inf{F*(v^,0,0,y^,y^,n^) 


inf{F  (v,y)l(v,y)  € M (v^,n^)} 


*-l  *v  . I 
(M  -^F  )(v^,n  ), 
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The  equivalence  of  the  identities  is  immediate  from  these  facts. 

Propositions  13  and  14  show  that  when  (5.8^)  holds  the  following 
diagram  commutes: 


* in  (x,u) 

F G 


M 


* 


in  (xo,y  ) 


The  * here  denotes  conjugacy,  modulo  minus  signs  (see  [21,  eq.  (4.17)]). 
Note  also  that  N is  just  the  inverse  adjoint  of  M,  modulo  orientation. 

It  remains  to  establish  the  third  relation  in  (6.1).  For  this, 
consider  the  product  transformation  L:  X x Y ->•  given  by 


L - X l‘2y 


(6.6) 


where  X -►  X^  is  the  sup-oriented  convex  process 


L.  (x)  = {x  } 
1 o 


and  L2:Y  R is  the  inf-oriented  convex  process 


L2(y)  = 


(6.7) 


(6.8) 


We  have  used  the  singleton  set  notation  here  to  emphasize  that  we're 
regarding  these  (ordinary  projection)  transformations  as  convex  processes. 
This  is  conceptually  helpful  later,  when  we  consider  the  (not  everywhere 
defined)  inverse  adjoints  and  corresponding  orientations. 

What  we  wish  to  do  now  is  to  form  the  image  of  H under  L, 
much  as  we  did  previously  with  M and  F in  (6.3)  and  with  N and  G 
in  (6.5).  There  is  now,  however,  the  ambiguity  of  whether  we  should  take 


or 


J- (x  ,n  ) = inf 
i o 


-1 


64 


sup  H(x,y) 
-1  I 


xSLj^  x^  y€L2 
inf {sup{H(x,y)  I y e L2^n^}lxe 


J2(x^,n  ) 


sup 

^7-1  1" 

y€L2  n 


inf  H(x,y) 

X 

1 o 


sup{inf{H(x,y)  ] X e L^^x^}|ye 


Our  final  result  shows  that  it  usually  doesn't  matter  in  which  order  these 

extrema  are  taken,  and  that  moreover  the  result  is 

PROPOSITION  15.  The  idenitity  J,  = H holds  without  any 

1 o 

additional  assumption.  If  (5.6)  holds  and  each  for  i G I is  closed, 

then  also  J_  = H holds. 

2 o 

PROOF.  Using  (6.7)  and  (6.8),  we  have 

Jl(x  ,n^)  = inf  supj  {H(x^,x^,C^,y^,y^,U^)} . 

By  (4.5)  this  equals  + <»  when  x^  ^ *^0’  ^ ^o 


continue: 


inf  sup,  {f  (x  ) - 'L*(y)  - <x,y>} 


^il-ffCXi)  y^.y 
each  l€  I 


inf  sup  {f  (x  )-[<x  ,y  > + )!<x  ,y  >]- 

iV  n 00  00  Lll 


each  ie  I 


For  any  choice  of  x, 's  which  fails  the  condition  x.  • x , V i ^ I»  the 
^ i i o 

term  in  brackets  can  be  used  to  drive  the  supremum  to  + »,  Hence 
we  can  continue: 


inf  {fQ(xQ)  - 1 ■^i^’^o^’  V i e I } 
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Now  this  inf Imum  equals  + “ vacuously  when  9 for  some  i S i ; 

when  X e c,  it  equals  - » if  any  n.  < 0,  and  it  equals  f (x  ) + Vn . f . (x  ) 
o 1 oo'iio 

I ^ 

if  ri  ^0.  Summarizing  all  this,  we  have 


f f ) 

I o o j i i o 


Ji(Xo.n  ) 


if  X e C and  n >0 

O — 

if  X e c and  i 0 

n ^ 


+ 00 


Ho(x^.n  ). 


if  X ? C , 
o 


where  the  last  equality  is  by  (5. A).  On  the  other  hand, 

= supj.  inf  {H(x  ,x^,C^,y  ,y^,n^)}. 

y^.y  x^,5^ 

By  (A. 5)  this  equals  + «>  when  x ^ C , while  if  x € c we  can 

o o o o 

continue 


= sup  inf 

each  i € I 


= sup 


each  i€  l 


inf 


If  any  <0,  then  for  all  y ,y^  the  term  - Jc.r).  can  be  used  to 
1 o J 1 1 

drive  the  inf imum  to  - “ , so  that  the  overall  value  is  - “ in  this  case. 
If  ^ 0,  we  can  continue: 


66 


* sup  inf  {f  (x  )-<x  ,y  > - T<x.,y  >+  Jn-f-Cx. 

oo  o o 

y<,+&i-o 

yo+Iyi=o 

- sup  + I <\»yi>-  I (fi*ni)(y^)} 

yi  I I 

'ft  ft  ^ 

In  this  calculation,  for  * 0 we  interpret  f^  ri^  as  f^  0 and 

ri^  cl  f^  as  ^ (by  Lemma  1 of  §3).  This  information  is  summarized  by 


f + J*"!  Pi 

f )(x  ) if 

1 o 

X 

o 

e c 

o 

and 

> 0 

J-(x  ,ri^) 
z o 

- 

X 

CD 

if 

X 

0 

e c 

o 

and 

i 0 

+ 

00 

if 

X 

? c 

y 

o 

o 

cl  f^  is 

♦cl 

when  ri^  = 0. 

If  (5.6)  holds. 

then 

C = 
o 

C and 

is  zero 

on 

c . 

o 

If  also  each 

fj.  is  closed. 

it 

follows  from 

(5.4)  that  J.  coincides  with  H . 

z o 

Proposition  15  justifies  the  relation  LH  = H^,  where  it  is  immaterial 
in  which  order  the  extrema  are  taken,  provided  (5.6)  holds  and  the 
closed.  This  completes  the  remaining  part  of  (6.1). 

On  the  strength  of  Propositions  13,  14  and  15,  we  have  that  the 


following  diagram  commutes: 
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*1 


"1 


in  u 


in  X 


* . I V 
F ilLJi — 


* in 


-:^G 

i 


The  here  denotes  partial  conjugacy  in  the  arguments  indicated,  modulo 

minus  signs  (see[21,  equation  (4.2)  and  (4.15)]).  In  fact,  there  is  among 

M,  L,  N a counterpart  to  the  partial  conjugacy  relations  among  F,  H,  G. 

* -1 

To  see  this,  noL?  from  (6.7)  that  the  sup-oriented  convex  process 
is  given  by 

*-l. 


'(v)  = 


{v  } if  v^  = 0 and  “ 0 
o 


0 


otherwise 


(6.9) 


1 

and  from  (6.8)  that  the  inf-oriented  convex  process  L2  is  given  by 


*-l 

L2  (u)  = 


if  u = 0 


and  u = 0 


0 


otherwise 


*-l  * -1 

Now  reversing  the  orientations  of  both  and  L2  , we  have  from 

(6.2),  (6.7)  and  (6.10)  that 


(6.10) 


*_1 

M = X L2 


(6.11) 


and  from  (6.4),  (6.8)  and  (6.9)  that 


*-l 

N = L2  X 


(6.12) 


It  follows  from  (6.11),  (6.6)  and  (6.12)  that  the  inverse  adjoint  operation 
on  a process  is  the  analogue  of  taking  the  partial  conjugate  of  a function, 
and  orientation  reversal  of  a process  is  the  counterpart  of  placing  a 
minus  sign  before  a function. 

Another  illustration  of  the  phenomenon  of  an  entire  duality  model 
projecting  onto  another  appears  in  [4,  §6]. 
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